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PREFACE 


Two years ago the University of Iowa Studies in Psychology 
included an integrated volume of psychological investigations 
envisaging the field of plastic and graphic arts. The present 
volume gives continuity to the progressive researches in that 
division of research. 

The test of any new endeavor in psychology is its productivity 
on a high qualitative plane. Certainly by this criterion Professor 
Norman C. Meier, who has directed all of these studies, deserves 
a great measure of credit. By instilling enthusiasm in his gradu- 
ate students, by planning and articulating the relatively inde- 
pendent investigations in a program of research extended over 
the years, and by circumspectly regarding the major problems 
from a scientific angle in a territory that is full of romanticism 
and humanistic evaluations, he has established his reputation as 
a critical psychologist who is also sympathetically versed in the 
plastic and graphic arts. It is a difficult task to apply qualitative 
distinctions and quantitative scores to a region that naturally 
resists the refinements of science. The reader will therefore 
notice that the attitude of these published researches is not to wipe 
away esthetic appreciation and artistic creation, but to under- 
stand them better from the scientific angle and to learn their 
intricacies within the limits of scientific possibility. 

The Editor of the Series is sincerely grateful to the several 
authors for their willingness to prepare their articles in con- 
formity with the established standards of publication in this 
Series. To Dr. Norman C. Meier the General Editor is espe- 
cially indebted for preliminary editing of the papers and for many 
suggestions looking toward the improvement of our Iowa Studies. 


CHRISTIAN A. RUCKMICK 
Towa City, Iowa 


15 April, 1936 
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INTRODUCTION 
by 


NORMAN CHARLES MEIER 


The eight studies of this volume are part of a research pro- 
gram known as “Genetic Studies in Artistic Capacity’ spon- 
sored by the Carnegie Foundation for the Advancement of 
Teaching and supported by funds of the Carnegie Corporation 
of New York. Acknowledgment is made to Dr. Frederick P. 
Keppel, president, and to the late President Henry Suzzallo of 
the Foundation and to President Walter A. Jessup, his successor. 
Without such interest and support these studies could not have 
been made. The program is under the direction of the writer. 

The studies follow those of a previously published volume,’ 
some being directly related to earlier studies reported therein. A 
third volume will contain further studies now in progress. 

The research program was projected as a series of investiga- 
tions, each contributing some advance toward one of the follow- 
ing problems: the nature and variation in the general population 
of artistic capacity and art ability (productivity) ; the conditions 
and limitations of learning in art education during the grade 
school and high school periods; the nature of the appreciation- 
process and conditions affecting its functioning; and new 
approaches to the study of creative mind. The procedure has 
involved a fruitful merging of psychological, art and educational 
viewpoints and personnel, utilizing theoretical, experimental, 
statistical and case study techniques. Theoretical discussion and 
planning is usually followed by experimental developments. 
When the latter reach a point where apparent findings are in 
sight, the procedure leads to application to larger numbers of 
subjects in the schools. Co-operation has been secured from the 
public schools of Cedar Rapids, Villisca, Grinnell, West Liberty, 


1 Studies in the Psychology of Art, Psychol. Monog. 45, 1933, No. 1. 
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Washington, and Des Moines, Iowa; Lincoln and Cozad, 
Nebraska; and Milwaukee, Wisconsin. Grateful acknowledg- 
ment is made to the superintendents, principals and teachers and 
particularly to Estelle Hayden, the Director of Art, of Des 
Moines and to Alfred Pelikan of Milwaukee, since access to 
subjects such as they provided was an essential condition for the 
development of the program. 

All of the investigations used children as subjects. Conse- 
quently all techniques and materials had to be adjusted or adapted 
to comprehension at the child-level. This the reader should bear 
constantly in mind. 

The first study by Voss was projected with a view toward 
discovering how the child in the grades secured whatever measure 
of art appreciation he may at that time possess. It is conceivable 
that appreciation might be simply absorbed from the environment 
without instruction. It is an educational tenet that contact with 
pictures or reproductions of famous paintings is instrumental in 
inculcating appreciation. Furthermore, various conditions affect 
the process, such as participation either in the active study of 
pictures, or in producing some kind of art work. ‘The question, 
furthermore, as to what extent appreciation is possible in the 
general population is something not at present known. These 
are some of the questions which the Voss investigation attempted 
to study. Such considerations are obviously basic to any intelli- 
gent plan of art education and constitute a neglected aspect of 
educational psychology. 

The Walton study of empathic responses in children carries 
the studies made by Lundholm, Poffenberger, Guilford, Roy and 
others down to the child-level with simplified materials adapted 
to the child-range of response, in order to discover whether or 
not empathic response does occur in children and the extent to 
which it becomes significant at each age-level. While empathic 
response is a special phase of appreciation, it is believed by a 
number of psychologists and art educators to be a basic consid- 
eration in disclosing artistic sensitivity. There is, undoubtedly, 
value in the study of the emotional suggestion of line, color, and 
facial expression as elements in art; and the Walton study shows 
that it has some differentiating possibilities. 
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The second Voss study is included in this volume as a con- 
tribution toward the reliability and validity of a modified form 
of the McAdory test applied to the lower grade levels. Hereto- 
fore, the Meter-Seashore, the McAdory, and other art tests have 
been considered primarily as instruments for use on adults but 
of doubtful value with children below the age of ten. In mak- 
ing certain changes in the manner in which materials are 
presented to the child, /’oss has adapted the test for use at 
lower ages. 

The study by Tiebout on the measurement of quality in chil- 
dren’s paintings by the scale method describes the procedure 
followed in the construction of a scale for experimental purposes 
in connection with a number of current studies. The necessity 
of having some objective basis for evaluating children’s pro- 
ductive ability and to provide a set of measurements that might 
then be correlated with intelligence, zsthetic sensitivity, progress 
in art ability from year to year, and other considerations, led to 
the construction of this scale through the co-operation of Des 
Moines schools and the co-operation also of 14 art educators in 
various parts of the country who served as judges. 

The study of the relationship of artistic ability and general 
intelligence was approached through three groups of subjects. 
The productive ability of approximately 100 subjects in each of 
the seven grades of a typically ‘average’ Des Moines school was 
ascertained by means of the scale and then correlated with gen- 
eral intelligence. The second group of subjects were selected 
students from the Milwaukee schools that were designated by 
Director Pelikan, with the assistance of supervisors and teachers, 
as outstanding in art activities. The third group consisted 
entirely of prominent American artists selected from a list of 
5,500 on the basis of general reputation, honors won, and gen- 
eral character of achievement. The interesting finding of this 
study is that while results are variable from grade to grade in 
the grade schools, the average of intelligence for the high school 
groups is definitely above average, while the average intelligence 
of the mature artists was found to be 119 at the dividing point 
between superior and very superior intelligence. 

The Saunders study was an attempt to determine experimen- 
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tally whether artistic ability is more or less fixed at birth or early 
in life; or, if, on the other hand, it is possible to change it by 
appropriate training. This touches on two widespread beliefs, 
namely, on the one hand that art ability is rare and more or less 
unpredictable and hence can only be known by its emergence. The 
other belief holds that any one can be taught to draw. To make 
the problem decisive, Saunders undertook to use as subjects a 
small group of grade school children who had never shown any 
interest or aptitude in drawing or painting. That he was 
able to show results which are indisputable and contrary to the 
expectations of some viewpoints, is remarkable. 

In contrast to this experiment is the case study of Loran Lock- 
hart, a boy practically blind from birth, for whom a fair degree 
of vision was made possible at the age of seven by cataract 
operations. While cataract cases in young children are not in 
themselves rare, a case of operative recovery accompanied by 
definite drawing ability, even under such a handicap as unsatis- 
factory vision, was believed to be sufficiently rare to warrant 
investigation. This case was one in which the factor of visible 
environment approached zero. Unlike the preceding studies the 
conclusion of this study points to a theory rather than to 
experimental findings. 

The final article is a description of a new approach to the study 
of creative artistic intelligence. It describes an apparatus or 
device for fabricating zsthetic organizations and controlling 
these at will. It is believed,—since almost any condition of light, 
mass, color, form, and organization can be had by this means, 
making it possible to duplicate colorings, moods or emotional 
quality,—that it would provide a flexible device of unlimited pos- 
sibilities for creative expression and for the construction of 
experimental compositions for various purposes. -This device 
is now being used by William McCloy in an investigation of the 
creative imagination of children and will be used in other studies 
for various purposes.' These studies will be described at a 
later date. 

The studies by Walton and Tiebout are postdoctorate investi- 
gations, the ones by Voss are both done as separate parts of her 








INTRODUCTION ix 


doctorate study as also was the investigation of Saunders. ‘The 
construction of the scale involved the participation of various 
research assistants and most of the ideas in all studies were 
benefited by criticisms and constructive consideration offered by 
the Psychology of Art Seminar group. 

The writer wishes to acknowledge further a contribution to the 
cost of publishing the volume made by the Conference for Re- 
search in Art of the American Federation of Arts and the Car- 
negie Corporation and also some partial aid for two of the studies 
by the Spelman Fund through co-operative arrangements with 
the Iowa Child Welfare Research Station. 

Acknowledgment is also made to Professor Christian A. Ruck- 
mick who, as general editor of the Iowa Studies, rendered 
service in the preparation of the volume for publication. 
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A STUDY. OF CONDITIONS AFFECTING THE FUNC- 
TIONING OF THE ART APPRECIATION 
PROCESS AT THE CHILD- 

LEVEL* 


by 


MiILpRED Dow Voss 


I. Introduction. ‘Two important forms of appreciative ability 
in graphic art are the ability of the individual to analyze the 
manner in which a composition has been esthetically organized, 
and the ability to recognize the effect of variations in the zsthetic 
organization of a picture upon its relative merit. The former 
type of appreciative ability will be designated as esthetic 
analysis, the latter as zesthetic judgment. 

This investigation concerns itself with the effect of two 
factors, direction and practice, upon the development of esthetic 
analysis and judgment in elementary school children, and upon 
the development of ability to produce an esthetically meritorious 
composition.” 

It would seem logical to suppose that esthetic analysis and 
judgment can best be developed through cultivation of a knowl- 
edge of the criteria upon which esthetic merit is based. These 


1An investigation in the research program, Genetic Studies in Artistic 
Capacity, directed by Dr. Norman C. Meier and sponsored by the Carnegie 
Foundation for the Advancement of Teaching with financial aid from the 
Carnegie Corporation. This study received aid also from the Spelman Fund 
through co-operative arrangements with the Iowa Child Welfare Research 
Station. 

2 The term ‘direction’ is adopted from Maier and Reninger who define it ii 
the following manner: “ What the difficulty is seen to be will determine the way 
the problem is attacked. This way of starting out on a problem is termed 
direction.” (9, p. 30.) A picture may be considered a substitute-object or an 
esthetic organization. Upon the adoption of one or the other of these view- 
points depends the entire subsequent character of the analytical or evaluative 
response to a picture. In this investigation the term ‘practice’ is used in the 
sense of continued experience over a period of time in dealing with problems 
of the type that are herein described. 
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criteria are the presence of relationships between the various 
art elements (such as form and color), which are described by 
such terms as balance, rhythm, sequence, and harmony. All of 
these relationships can be subsumed under the larger term ‘ prin- 
ciples of art’. A picture possesses zsthetic merit when each 
element is related to itself and to the other elements in such 
a manner that the composition would be less satisfying if the 
existing relationships were added to, subtracted from, or altered 
in organization (unity). 

There are certain ways of looking at a picture which are not 
based upon these criteria. One may look at a picture to deter- 
mine what is portrayed (subject-recognition) ; how well the 
portrayal resembles the actual model (photographic representa- 
tion) ; to what extent the object interests the observer (subject- 
interest) ; what message is conveyed by the situation in which 
the object is portrayed (literary interest) ; or how well the setting 
of the object conforms to those conditions usually accompany- 
ing its occurrence (historical or naturalistic accuracy). All of 
these ways of looking at a picture are characterized by a common 
approach. The picture is perceived as a representation of an 
object which is henceforth considered in the same manner as the 
actual object would be considered were it present—that is, the 
picture becomes a substitute-ob ject. This attitude is opposed to 
the approach which considers the object portrayed as more or 
less incidental and regards as important the degree to which the 
picture is a unified organization of pictorial elements. From this 
character of opposition it would seem to follow that these two 
ways of looking at a picture are mutually exclusive. As long 
as the picture is understood as a substitute-object, the zsthetic 
organization is disregarded. When a picture is understood as 
an @sthetic organization, the subject-matter is secondary. 

Accordingly one of the principal obstacles in the development 
of esthetic analysis and judgment occurs at the beginning of 
the process. A picture must be perceived in terms of an organi- 
zation of esthetic elements in order that development may occur. 
Attitudes which inhibit such an understanding of the problem 
must be broken down. Specifically, this investigation proposes 














ART APPRECIATION PROCESS AT THE CHILD-LEVEL 3 


to test the hypothesis that an important step in the development 
of xsthetic analysis and judgment consists of the revision of 
attitudes in which good art is perceived as portrayal of interest- 
ing subject-matter, photographic representation, evidence of tech- 
nical ability, a peg upon which to hang associated content, or a 
story-telling device; and the substitution of the attitude that 
good art involves selection, elimination, and arrangement 
according to principles such as unity, harmony, and rhythm. 

Once the comprehension of art as an organization of pictorial 
elements is established, the factor of practice, or experience in 
the perception of ezsthetically significant relationships, becomes 
important. As the principles of art are generalized relationships 
which may be exhibited in many concrete forms, adequate under- 
standing requires abstraction of a common relationship from 
many examples. With repetition of similar types of problems, 
relationships which are at first perceived with difficulty are recog- 
nized more easily upon subsequent reappearance. With mastery 
of this relationship the attention may be turned to relationships 
which are as yet difficult to perceive. Accordingly there is an 
increase in the number, and probably in the quality, of the rela- 
tionships which can be identified. Eventually more or less com- 
plete analysis requires that each element be related through a 
system of increasingly inclusive relationships to the whole of 
which it is a part. 

With the attainment of at least a partial understanding of the 
principles, the basis for the development of esthetic judgment 
has been established. Upon the presentation of two or more 
methods of creating an esthetic organization, each method must 
be scrutinized as to its degree of success in the attainment of 
unity. The respects in which the organizations vary and the 
influence of each variation upon the total effect must be noted, 
and judgment made on the basis of the appropriateness of the 
variation to the total form of the picture. With increased expe- 
rience, variations which are at first with difficulty recognized 
as enhancing or destroying the unity of the composition are, upon 
repetition of similar problems, recognized with greater ease so 
that the attention may be turned to problems not yet solved. 
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With increase in ability to evaluate the unifying qualities of more 
types of variations there occurs greater accuracy of judgment. 
The second hypothesis of this investigation is that practice in 
the recognition of the principles of art is an important factor 
in the development of zsthetic judgment. 

II. The effect of adequate direction upon the development of 
esthetic judgment. Experiment I. Experiments I and II 
attempted to isolate the effect upon zsthetic judgment of the 
manner in which the problem of looking at pictures was 
approached. Experiment I undertook to test the assumption 
that practice in the judgment of differing degrees of merit can- 
not bring about a significant increase in the development of 
esthetic judgment without an initial comprehension of the picture 
as an organization of pictorial elements. 

The subjects of Experiment I were 101 children in the second, 
third, and fourth grades of the public schools of Villisca, Iowa.® 
The children were divided into two matched groups on the basis 
of intelligence (IQ), mental age (MA), chronological age (CA), 
grade, sex, and esthetic judgment score. IQ and MA were 
determined by the Kuhlmann-Anderson Group Intelligence Test, 
4th edition. A‘sthetic judgment was measured by a modified 
form of the McAdory Art Test) in which judgment was required 
only between the best and worst degrees of merit of each exer- 
cise.* Table I presents the mean scores and standard deviations 
for each of these measures for each group. 

The experimental group, Group A, was assembled as a class 
for 30 min. a day, three days a week for ten weeks. These 
ten weeks were distributed over a period of twenty weeks through 
which two weeks of classes alternated with two weeks of rest. 
Approximately 140 paired or multiple-choice illustrations were 
presented in which two or more alternative organizations were 
shown simultaneously. These alternative organizations were 
similar in every respect except that the arrangement of objects 


3 Acknowledgment is made to Mr. H. E. Dow, Superintendent of Villisca 
Public Schools, and to the principal and teachers co-operating in this study. 

4For a detailed discussion of the administrative procedure, validity, and 
reliability of this form of the test, refer to the article in this volume entitled 
The Validity and Reliability of a Modified Form of the McAdory Art Test for 
Use at Lower Grade Levels. 
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TABLE I. Description of Graups A and B in Experiments I and II ® 
No. of 

Subjects Art Score IQ MA CA 
Grade Group T F M Mean S.D. Mean _ S.D. Mean _ §S.D. Mean _ S.D. 
iI A ee 33.9 7.8 102.6 8.1 9,3 2: 6.2 89.2 6.6 
B : a 33:6 5.7 105.5 8.7 728. 364 wz. 7.0 
ITI A nO 43 35.5 6.4 3. 27.4 102.1 12.1 104.8 12.1 
B 20 10 10 35.0 4:2 102.0 14.5 102.5 12.4 pew . 6.5 
IV A | ie 36.4 8.6 100.3 10.8 115.8 10.5 iso.e 6.5 
B i 41. 6 37.8 6.6 191.5 iz.t 115.4 16.4 114.4 7.3 
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or the form of the objects was changed so that in one member 
of the pair a pr.nciple was exemplified while in the other member 
the principle was violated. ‘The general form of a number of 
these illustrations was adapted by the experimenter from illus- 
trations in magazines, newspapers, and children’s books.® The 
remainder of the exercises used were selected from the W/utford 
Art Appreciation materials ‘, from a preliminary version of the 
Christensen Art Test *, and from the Jacobson Costume Design 
Booklet.” In the experimental period each child was presented 
with a mimeographed copy of the exercise. The experimenter 
pointed out the respect in which the exercises differed, taking 
care, however, not to suggest the esthetic principle involved. 
Each child then checked the picture which he would like to have 
returned to him at the conclusion of the experiment. Plate | 
illustrates the form in which the exercises were presented. 

At the conclusion of the experiment both experimental and 
control groups were retested by a repetition of the McAdory 
Art Test. To determine if either group had gained significantly 
in mean esthetic judgment score during the experiment, the 


* The IQ, MA, and CA scores are in terms of months. 
6 A copy of these exercises is on file at the Psychology of Art division at the 
University of Iowa. 
7 Whitford, W. G. An introduction to art education. 
1929, plates I, II, III, and IV, exercises 2, 4, 5, 9, and 11. 
8 Christensen, E. C. A test in art appreciation (revision of March, 1927), 
exercises 3, 4, 6, 8, 10, 11, 12, 13, 16, 18, 20, 21, 23, 24, 25, 26, 28, 36, 38, 39, 40, 
41, 46, 47, 48, 49, 52, 53, 54, 56, 58, 59, 60, 62, 65, 66, 67, 70, 71, 75, 76, 77, 83, 
89, 90, 91, 92, 93, 98, 99, 100 (Photographic print form). 
® A copy of this booklet is obtainable at the Psychology of Art division at 
the University of Iowa. It is described in the article by Jacobson, W. E., An 


N. Y., Appleton, 


experimental investigation of the basic esthetic factors in costume design, 
Psychol. Monog., 45, 1933, No. 1, 147-184. 
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following formula was applied to determine the probable error 


of the difference between the original and final mean test score 
for each group. 





(1) P.E.aire.= V P.E.x?+P.E.22—2r P.E.x, P.E.x, (2, p. 286) 


P.E.x: and P.E.x2 represent respectively the probable error of the mean 
scores in the initial and final test for a particular group. 


r represents the Pearson product-moment coefficient of correlation 


between the initial and final score on the McAdory Art Test for each 
subject. 


The significance of the difference between the original-and final 


test scores was determined by the formula (2 ) p % (2, p. 136) 


wri tt, 
in which D represents the obtained difference between the original 


and final mean test scores and the P.E.,;;;, was obtained as indi- 
cated in formula (1). The chances of a true difference greater 
than zero was obtained by reference to a table of probabili- 
ties (2, Table XV). 

To determine if the difference between the means of the 
experimental and control groups on the final test was significant 
the following formulae were applied. As the effect of the 
methods compared resulted in different values of the correlation 
coefficient between initial and final measures for the two groups, 
the following formula devised by Wilks was applied inde- 
pendently to the mean of each sample and the resulting values 
substituted for the probable errors of the means in the usual 
formula for the probable error of the difference. 





6745 li-r 
(3) P.E.awj= x 5 A 
VN 
P.E.m1 represents the probable error of the means of matched groups. 
r is the Pearson product-moment coefficient of correlation between the 


measures used as the basis for matching (the initial McAdory Art Test 


score) and the measures for which the mean is computed (the final 
McAdory Art Test score). 


(4) PL. V P.Exv.P+P.E.sv.! 








The results for Experiment I are to be found under that 
heading in Table II. 


These results seem somewhat contradictory. Simple repeti- 
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tion of the test, which can be construed as practice in judgment 
of varying degrees of merit, did bring about a significant 
increase in zsthetic judgment score from the initial to the final 
test in each group, while a cénsiderable amount of practice in 
additional exercises failed to result in a significant difference 
between the final scores of the groups. A possible explanation 
of these results might be the greater familiarity on the part of 
the child with the experimental set-up. In the first test the chil- 
dren were for the first time exposed to an individual test situa- 
tion. In the final test, experimental conditions, experimenter, 
and test exercises were familiar to the child. An element of 
competition between groups was also introduced as a motivating 
factor in the final test. It seems probable that the difference 
between final test scores of the two groups is a more valid indi- 
cation of the effect of practice upon an increase in esthetic 
judgment score than the increment between initial and final test 
scores of a single group. This difference in final test scores 
was not significant, a result which would seem to indicate that 
practice in judging between varying degrees of esthetic merit 
without an understanding of the proper criteria to be employed, 
produced through instruction in the principles of art, will not 
produce a significant increase in esthetic judgment. 

Experiment II. ‘To determine if the same experimental exer- 
cises accompanied by a verbal explanation of the principles of 
art would produce a significant increase in zsthetic judgment, 
Experiment II was performed. The same subjects were used 
in this experiment, but with the groups reversed. The new 
experimental group met for five days each week, for three con- 
secutive weeks. The class period was 30 min. in length as before. 
The procedure, except for an abbreviated series of exercises 
(N=100), was the same as in Experiment I with the variation, 
that, after the child had indicated his preference and the slips 
had been collected, the experimenter designated the correct illus- 
tration and explained, in terms suitable to the age of the children, 
the principle of art involved. A typical explanation is presented 
-accompanying Figure 1. 
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Fic. 1. Illustration of the type of experimental exercises used in 
Experiments I and II 


DoMINANCE BY PLACEMENT AT POINT OF RADIATION 


Directions for Experiments I and II. In this picture a frog is sitting on a 
big lily leaf. He is sitting in a different place in each picture. Put the letter 
of the picture that you like best by number 16 on your paper. Pass your paper 
to the person back of you. 


Further explanation for Experiment II. You remember that we said there 
must always be a captain of a picture just as there must always be a captain of 
a football or baseball team. The most important thing, or the thing that we 
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look at most, in a picture is the captain. In this picture we look at the frog 
most so that he must be the captain. But you remember that when lines came 
to a point they made a captain too. Now in this lily pad there are lines that 
lead to a point. In picture A we have one captain here where the lines meet 
and another captain here at the frog. What happens if we have a team with 
two captains? It doesn’t get along very well, because one captain often wants 
to do one thing while the other captain wants to do something else. Until one 
captain gives in, the team can’t play ball. 

Suppose we make Jack captain of the baseball team of this school. The 
teams from all of the schools meet and make a town-team. Jack is elected 
captain of the town-team so that he is captain of the school-team and captain 
of the town-team both. He is a more important captain now than if he were 
just captain of the school-team. In picture B, the frog is placed where the 
lines come to a point. He has been elected captain of the lines and the whole 
picture both so that he becomes a more important captain. The picture is better 
just as a team is better the more important the captain is. B is the better 


picture because it has one important captain rather than two captains that don’t 
get along together. 


At the conclusion of this experiment both groups were retested 
and the results treated by the same statistical methods as in 
Experiment I. These results, presented in Table II, indicate 
that there was a significant increase in score on the McAdory 
Art Test in the experimental group but that the increase in test 
score for the control group was not significant. Furthermore, 
the difference in final score of experimental and control groups 
was significantly in favor of the experimental group. From 
these results the conclusion seems to be justified that it is neces- 
sary that the child comprehend the picture as an organization 
involving those relationships which have been designated as prin- 
ciples of art before experience in judging between varying 
degrees of zxsthetic merit will produce a significant increase in 
zesthetic judgment. 

III. The effect of number of exercises judged upon gain in 
esthetic judgment score. During the course of Experiment II 
illness necessitated the absence of a number of children during 
portions of the experiment. Accordingly the number of exercises 
judged by each child varied considerably. The procedure in this 
experiment consisted of the presentation of several exercises at 
each class period, each exercise differing in the nature of the 
principle exemplified. During the course of one lesson, there- 
fore, a number of principles were explained and succeeding 
instruction consisted, for the most part, in the application of a 
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relatively few principles to pictures of varying content. Thus, 
a child might attend the class only a few days and acquire a 
knowledge of the general attitude and fundamental principles 
being employed. Constant attendance at the class, however, 
resulted in a much greater amount of practice in the application 
of this core of ideas to a great variety of examples. Advantage 
was taken of the large variation in number of days’ attendance 
to determine if the number of exercises checked and explained 
bore any relation to the amount of gain in esthetic judgment 
score. 

On the basis of the general shape of the distribution curve 
of exercises checked, the children were divided into two groups, 
those having checked more than 90 exercises and those having 
checked fewer than 90 exercises. The average amount of gain 
on the McAdory Art Test for each group was then determined. 
The results are shown in Table III. 


Tas_eE III. A comparison of the group checking more than 90 exercises 
and the group checking fewer than 90 exercises in amount 
of gain in Experiment II 


Mean Mean 

number initial 
No. of of exercises McAdory Mean Mean Mean Mean 
Group subjects checked score IQ MA CA gain 
90-100 14 95.7 35.5 102.5 106.9 105.0 6.09 
30-90 11 53.5 35.3 97.5 97.5 95.1 2.36 


The gain in the group checking over 90 exercises was 258 
per cent that of the group checking fewer than 90 exercises. 
However, it is evident that the IO, MA, and CA scores were 
considerably in favor of the group making the greater gain. In 
order to determine if these factors were affecting this gain in 
score, the children in the two groups were paired according 
to each measure and the difference in gain recomputed. For 
example, a child having an IO of 104 in the 90 to 100 exercise- 
group was matched to a child having an IQ of 104 in the 30 to 
90 exercise-group. Children were paired only if the measures 
in question were within one point of each other so that the 
average and standard deviations of each group were identical. 
The two new groups thus obtained were then compared for gain 
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in zsthetic judgment score. The same procedure was followed 
with MA, IQ, and initial McAdory Art —— score. These 
results are presented in Table IV. 


TABLE IV. A comparison of the group checking more than 90 exercises and the 
group checking fewer than 90 exercises in amount of gam im 
Experiment II when the groups were paired according to 
MA, CA, IQ, and initial McAdory Art Test score 


Groups 90-100 exercise- 30-90 exercise- Difference 
matched for group group in gain 
MA 7.4 3.7 3.7 
CA 5.9 1.4 4.5 
IQ 6.4 2.8 3.4 
McAdory Art Test 6.4 4.4 2.0 


Because of the small number of cases it was not feasible to 
determine the significance of the difference in gains. The fact, 
however, that the gain was substantially in favor of the group 
checking the greater number of exercises no matter in which 
factor the groups were matched seems to justify the conclusion 
that the difference in gain in exsthetic judgment score was a 
function of the number of exercises judged. 

IV. The effect of similarity of experimental exercises to test 
items upon gain in esthetic judgment score. The conclusions 
from the preceding experiments regarding the functions of 
knowledge of the principles of art and practice in their recog- 
nition might be challenged on the grounds that the experimental 
exercises were so similar to the test items that the increase in 
esthetic judgment score might be attributable to a type of 
indirect coaching upon the test. 

A variation in Experiments I and II afforded a chance to test 
the specificity of gain on the test items to the type of exercises 
used in the experimental series. The McAdory Art Test is 
divided into three sub-tests, Form, Notan, and Color, for each 
of which a separate score may be derived. Form includes exer- 
cises dealing with the shape of objects; Notan, with the distri- 
bution of light and dark; and Color, with the harmonies and 
accents of color combination. In Experiments I and II, in addi- 
tion to the McAdory Art Test, two supplementary tests were 
administered: (1) the Whorley Unity Scale of Landscape 
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Arrangements, the following scale values being presented as 
paired comparisons, values one and seven, two and six, three and 
five, for each of the two forms of the Bird-bath and Entrance 
Scales (17); and the Lewerenz Test of Fundamental Abilities 
of Visual Art, Part I, Test 1, Recognition of Proportion (7 ).*° 

Only three exercises in the training series of Experiments | 
and II involved notan or color, while a considerable number of 
exercises dealt with shape, proportion, and the organization of a 
picture for the production of unity. The principles of art, how- 
ever, apply to all of the elements enumerated. To determine 
if a gain had occurred only on those test items resembling the 
exercises, the difference between the gains in score of the experi- 
mental and control groups was determined for each of these 
sub-tests. The results are presented in Table V. 

Inspection of Table V reveals that, in Experiment I, although 
the difference in gain on the sub-test, Form, was greatest, the 
difference in gain on the sub-test, Notan, was only slightly 
less. Both of these differences approached significance. In 
Experiment II although the greatest difference in gain occurred 
on the sub-test, Form, the difference in gain on the sub-test, 
Color, as well as on the sub-test, Proportion, approached sig- 
nificance. Thus a gain was demonstrated on two types of test 
items differing in the art element involved from the exercises 
used in the experimental series. The difference in gains on 
the sub-test, Unity, was not significant in either experiment."’ 

When Experiments III and IV were devised, certain modifica- 
tions in the types of exercises were made to determine the justifi- 
cation of the objection that the gain in test score was due to 
the similarity of items to experimental exercises. 

The exercises used in Experiment III consisted of examples 
illustrating only the application of the principle. The child was 

10 As the McAdory Art Test alone was used as a measure of esthetic judg- 
ment in Experiments III and IV, the scores from these tests were not included 


in the zsthetic judgment scores in Experiments I and II. The results are 
presented for these tests separately in Table V. 

11 As the mean initial scores on this test were high, to gain significantly, the 
child would be required to judge correctly between the items ranked third and 
fifth on the scale. The brief experimental procedures were probably not suffi- 
cient to bring about such an increment in ability. 
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required to induce the principle from its application in many 
specific examples. These illustrations were drawn from a large 
variety of newspaper and magazine sources and exhibited the 
principle in widely varying types of subject-matter. It seemed 
that these training exercises were sufficiently dissimilar to the 
test exercises that any change in McAdory Art Score which 
might result from the experimental procedure should not be 
attributable to direct or indirect coaching upon the test exercises. 

The children used in Experiments III and IV were fifth-grade 
children of the public schools of Grinnell, Iowa, a town of about 
5,000 population. The children were matched according to 
grade, sex, IO, MA, CA, and McAdory Art Test score as in 
Experiments I and II. The standardized four-item form of the 
McAdory Art Test was used instead of the two-item form. 


Tas_e VI. Description of Groups A and B, Grade V, in Experiments III and IV 


No.of °- 
Subjects Art Score IQ MA DA 
Group T F M Mean S.D. Mean S.D. Mean S.D. Mean S.D. 


A 44 23 21 eS 22.2. 16.9 1.7 27 11.3 128.8 8.6 
B 41 23 18 99.3 20.3 102.0 11.4 20.0 9.4. 124.5 8.4 


Table III presents the mean scores and standard deviations for 
each of these measures for each group. 

The experimental group, Group A, met for thirty minutes 
daily for five days a week for three weeks as a part of the 
regular grade curriculum. Group B received no training. In 
this experiment a systematic outline of some of the major prin- 
ciples of art was developed. The children were first presented 
with a verbal explanation of the principle. After this explana- 
tion, a number of the illustrations of the type described above 
were presented, showing the application of the principle in 
widely varying types of situations. The experimenter pointed 
out, in each case, how the illustration exemplified the principle. 
After a number of principles had thus been presented in the 
training series a group of illustrations exemplifying one prin- 
ciple was presented and the children were required to abstract 
the principle possessed in common by all of the illustrations. 
The outline of the principles which was repeatedly placed before 
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the children as a basis for discussion is reproduced in Plate I. 
Plate II illustrates the type of exercises used, and the manner 
in which these exercises were combined to exemplify a common 
principle. 

Following the experimental period, both groups were retested 
and the results treated statistically as in Experiments I and II. 
The results, presented in Table II, demonstrated that training 
in ability to recognize zxsthetic principles in exercises of this 
type did increase esthetic judgment significantly. 





Prate II. 


Experiment IV. Experiment IV utilized exercises similar in 
type, although different in subject-matter and degree of com- 
plexity, to those used in Experiments I and II. As in the former 
experiments, the child was required to find which of two 
examples exemplified, which violated the principle, in order to 
determine which exercise was esthetically better. Experiment 
IV can be considered as a repetition of Experiment II under 
more refined conditions. The experiment was conducted in the 
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fifth grade for which the McAdory Art Test is standardized; 
the exercises were drawn from sources which would insure a 
more adequate validity; and the explanations of the principles 
of art were presented in more systematized form. 

Group B became the experimental group in this experiment, 
meeting for the 30 min. each day for five days a week for three 
weeks as had Group A previously. The children were presented 
with the same general explanations of the principle under con- 
sideration as in Experiment III. Group B, however, was pre- 
sented with the exercises described above which approach much 
more closely in general character to the test items than the exer- 
cises used in Experiment III. A few illustrations were drawn 


TABLE VII. The significance of the difference between gains onthe McAdory 


Art Test of Groups A and B in Experiments III and IV 











Group A Group B 
e # est P ‘ual A 
& - & e 
os on ws - 
se OS i. 
Eup a 8 oe ae ge ee Critical 
ment oe Y) A, a ae WV) C, P.E.airr. ratio P 
III 44 16.7 18.5 1.88 41 8.9 22.9 2.41 3.06 2.55 95 
IV 44 2.0 16.4 1.67 40 8.5 19.5 2.08 2.65 2.45 95 


from magazines and newspapers but the majority of exercises 
were taken from Art im Everyday Life (3) by Harriett and 
Vetta Goldsten and from the Meter-Seashore Art Judgment 
Test (12).*° 

The results, treated statistically as in the previous experiments 
are presented in Table II. These results indicate that there was 
a significant increase in mean esthetic judgment score in the 
experimental group but not in the control group. The differ- 
ence between final mean scores of the two groups was not 
significant. However, the initial mean score of the experimental 
group was much lower than that of the control group. Accord- 
ingly, the difference between the mean gains in score made by 
the experimental and control groups on Experiments III and 
IV was computed. The results are presented in Table VII. 


12 The general discussion of the principles as presented in Experiments III 
and IV, and references to the source of each exercise used in Experiment IV 
are on file at the Psychology of Art division, University of Iowa. 
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These results indicate that, although both groups made a much 
larger gain in score on Experiment III, the difference in gain 
between the two groups and the significance of the difference 
is the same in both experiments. 

It appears, therefore, that although the type of exercises used 
in Experiments III and IV differed considerably, and although 
the exercises used in Experiments II and IV were different in 
subject-matter and degree of complexity, a significant gain was 
demonstrated in zsthetic judgment score in each experiment. 

In Experiment III the exercises involved all of the elements 
tested in the sub-tests of the McAdory Art Test. In general, 
the amount of emphasis placed on exercises illustrating each 
element was in the order; form, color, and notan. Notan 
involved but few exercises. The results, presented in Table \V, 
indicate that the difference in gains between the experimental 
groups was in the order; form, notan, and color. 

In Experiment IV the exercises placed little emphasis on the 
element of color but much emphasis on the elements, form and 
notan. The difference in gains between experimental and contro! 
groups, however, was greatest for the element, color. 

These results, in conjunction with those secured in Experi- 
ments I and II, indicate that differences approaching significance 
in gains between experimental and control groups did occur on 
sub-tests involving a different element of art than was involved 
in the experimental exercise series. 

From these results, the comparative gain on Experiments II] 
and IV and the comparative gain on sub-tests, the conclusion 
is drawn that there was actually a change in the ability of the 
subjects to judge zsthetically between the merits of different 
pictures and that the increase in zsthetic judgment score was 
not due to an indirect type of coaching upon the specific type of 
exercises utilized in the McAdory Art Test. 

V. Conclusions regarding esthetic judgment. The experi- 
ments thus far presented seem to justify the conclusions that 
among elementary school children of the second, third, fourth, 
and fifth grades (1) practice in judging exercises of varying 
degrees of zsthetic merit without understanding the principles 
of art involved does not produce ‘a significant increase in 
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esthetic judgment; (2) a knowledge of the principles of art 
with practice in the application of these principles to exercises 
of varying degrees of esthetic merit will produce a significant 
increase in xsthetic judgment; (3) although a knowledge of the 
principles of art with a small amount of practice in the appli- 
cation of these principles will produce an increase in esthetic 
judgment, it will not produce as great an increase as a knowl- 
edge of the principles of art with a greater amount of practice 
in the application of these principles. ; 

VI. The effect of the experimental procedures in Experiments 
III and IV upon the development of e@sthetic analysis. The 
experimental program definitely instructed the subjects as to the 
manner in which a picture is esthetically organized. It was 
desired to determine to what extent the children were able to 
understand and profit by this training. Accordingly, each child 
was individually questioned according to a predetermined series 
of questions which were designed to sample the child’s ability 
to discern the zsthetic organization of Franz Marc’s Red Horses. 
The child was thus questioned before and after each experi- 
ment. ‘These questions were not a test in the standardized sense. 
The responses which they provoked should be considered rather 
as a sample of the child’s manner of thinking about a picture, 
as a painting from a child is considered as a sample of his pro- 
duction. ‘The reservations should also be made that the type of 
zesthetic analysis which was called forth by these questions is 
far from being a complete analysis of the zsthetic organization 
of the picture. Any responses which indicated a consideration of 
one portion of the picture in relation to the whole were con- 
sidered as evidence of esthetic analysis. Thus the questions 
may more properly be said to have stimulated responses which 
were indicative of a rudimentary esthetic analysis. It is assumed, 
however, that zsthetic analysis is found in many degrees and 
that such responses indicate a low degree of ability to analyze 
the zsthetic organization of a picture. 

The series of questions is presented below: 


This picture is called Red Horses and it was painted by an artist named 
Franz Marc. I am going to ask you some questions to find out if you like the 
picture and why you like it. 
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1. Do you like the picture? What do you think is good about it? After the 
training period, the following question was added: Can you see anything in the 
picture that we studied about in class? What we studied about in class may 
help you to answer some of the questions. 


2. What is good about the way the artist has painted the picture? You know 
we say that a person draws something well. What has the artist painted well? 


3. Do you like the colors he has used or do you wish that he had used 
different colors? Why? 


4. What do you look at most in the picture? What makes it (or them) show 
up or stand out so much? 


5. Why do you suppose the artist has put blue here and here and here (point- 


ing to water, bush, trees, and valleys) instead of putting blue just down in this 
corner? 


6. Why did the artist make this such a light green—almost yellow—back of 
the horses instead of making it this dark green up on the hill? 


7. How would you have liked it if the artist had made the horses standing on 
flat ground instead of hills? Would you have liked it as well? 


8. Why did the artist put the horses where he did in the picture? Why didn't 
he want to put the horses farther over on this side in the picture (to the right 
side) ? Why didn’t the artist want to put them back here on the hills? 


9. I wonder why the artist painted the horses standing that way. You see 
they are turned different ways and one is a little higher than the others. 


10. If the artist had left the middle horse out, would you like the picture as 
well? Why? 


Why do you suppose he made the middle horse a different color? 


11. Would you like the picture as well if he had left out this red horse 
(right) and painted only these two horses in the picture? 


12. Why do you suppose he made the bush blue instead of green? How 
would you have liked it if it had been green? 


13. How does the picture make you feel? Why? (If no response was given, 
the following question was added.) Does it make you feel sad or does it make 
you feel some other way? How is it different from a sad picture? 


14. (After training) Can you find anything else in the picture that we talked 
about in class? 


The responses to these questions were analyzed in the follow- 
ing manner. The entire series of responses was examined and 
typical answers arranged in a tentative classificatory system. 
Each paper was read with one type of response, such as balance, 
in mind. Responses classifiable under this head were recorded 
in such form that simple counting would give a final score for 
the groups or sub-groups for which a score was desired, and so 
that quick reference could be made to the original record from 
which the response was taken. The first few responses of a 
sample record are recorded below: 


CONTRAST IN VALUE FoR EMPHASIS 
CoorpER ScHoo. CC Group Test III DarK ON LIGHT—LIGHT ON DARK 
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1. They are dark and the lighter color makes them show up more (6). 
Dominance of dark on light, horse’s head (14). 


2. I suppose the horses show up more. They’re dark and it’s light (6). 


X or C indicates that the group is the experimental or control group. I, II, 
or III indicates the number of the test in the testing series. The final phrase in 
the heading characterizes the type of response enumerated below. The number 
at the beginning of the response is the code number for the individual in the 


group. The number in parentheses following the response indicates the question 
to which it is the answer. 


The above process was continued until all of the responses 
had been classified. Those responses which did not fall under 
any of the classifications originally devised were classified under 
headings which seemed applicable. 

These record sheets were presented to a group of ten judges 
who were asked to read them determining if the choice of the 
heading characterizing the response was correct and checking 
responses which they considered improperly classified. These 
judges were also asked to write their responses to the same ques- 
tions asked the children. On the basis of these answers and com- 
ments, the classification of the responses was somewhat revised. 
Each classification was grouped under one of two heads— 
esthetic and non-zsthetic. Non-zsthetic responses included 
unclassifiable responses, responses indicating an incorrect zesthetic 
response, responses in which the general attitude exhibited 
seemed to refer to the picture as a substitute-object (naturalistic 
representation) and esthetic responses which were so incom- 
plete as to make them meaningless as an indication of the child’s 
knowledge of the principles of art. A¢sthetic responses included 
those in which there seemed to be a partial or complete realiza- 
tion of the picture as an organization of formal relationships. 
The final classificatory system is presented below: 


Categories for the Classification of Atsthetic Analysis Responses 


A. Non-esthetic. Responses which are unclassifiable, which 
are incorrect exsthetically, or which refer to the subject 
matter of the picture rather than to its esthetic organization. 


Unclassifiable responses. Ambiguous character, not assignable to any cate- 
gory. Examples: ‘“ Some pictures show all hills, I have an idea.” 
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Incorrect responses. Affirmative answer to a question which involves an 
undesirable change in the composition. Example: “It would look just as well 
with blue just in this corner.” 

Interest in subject-matter. Interest in the object portrayed as an object, in 
the activities of the object, and in the setting of these activities in time and 
place. Object not considered as a part of the pictorial organization. Examples: 
“Yes. I like stories about horses in pictures.’ ‘“‘ The horses look like they 
are going to be captured.” “ Water. The horses could drink it.” “Looks 
about like sunset.” 

Enumeration. Naming over or description of objects in the picture without 
further elaboration. Examples: “ Rocks stick up more along here.” “ Hills. 
Horses. Bushes.” 

Naturalism. Desire for photographic realism. Examples: “Horses aren't 
very pretty for their redness. Sort of tan better for horses.” “ Wanted them 
(horses) that way. Saw them that way.’ “ Horses don’t stand straight all 
the time. These look natural.” 

Logical responses. Adequate from a reasoning but not from an esthetic 
standpoint. Examples: “ Shows it’s warm because the grass is green.” “ Bush 
is blue. Flowers maybe.” ‘ That’s where the horses ate off of it (grass).” 

Drawing. Applications of what has been learned in the drawing class dealing 
with proportion, naturalistic representation, and skill in handling the media, but 
failure to relate the elements to the composition as a whole. Examples: ‘ Way 
he drew the horses. You can’t always get the right proportions.” ‘ Colors 
don’t run together. It’s not messy.” “ Didn’t take too many pains painting it. 
He just sketched it.” “It would be harder to draw if the horses were up on 
the hill.” 

Preference for object variety. Preference for variety of ways of portraying 
an object without reference to esthetic relationships. Examples: “So you 
could see the back of the horses.” ‘“ Wanted to get all three of them. This 
horse may be half-way in, half-way out.” “ Horses’ shapes. These two looking 
at you. Others not. Gives a better view of his neck.” 

Generalities. Indicative of an evaluation but not indicative of ability to 
analyze the basis for that evaluation. Examples. “ That’s pretty.” ‘“ Horses 
arranged nice.” “ Probably liked it better.” ‘‘ Picked good colors for the sky 
and ground.” ; 

Stereotyped phrases—color. Phrases of common occurrence indicative of a 
preference for certain color combinations which do not evidence a real compre- 
hension of the problems involved in such combinations. Examples: “ The 
colors are blended in.” ‘“ Look like they match.” “ Color goes good together.” 

Preference for the unusual. Preference for a subject or technique because it 
differs from that usually found. Examples: “ Bushes. Look so unordinary.” 
“Purple in the corner. Don’t see so much of that in pictures.” ‘“ So many 
pictures have level ground.” 

Personal associations. Projection of the observer within the frame of the 
picture as participating in the activity portrayed. Examples: “ Be on a horse’s 
back riding. Never been around a whole lot of hills. Would be on a lot of 
hills.” “ Like to get outside and play.” 

Sensuous enjoyment of color. Enjoyment of a particular color for its own 
sake without reference to the remainder of the composition. Examples: ‘“ Like 
the background. Colors he colored.” “ Yes. The red horses. My favorite 
color.” “ Pretty. Like light colors better than dark.” 


Perception of movement. Perception of the portrayed movement of an object 
but not movement inherent in the pictorial relationship (e.g., rhythm). 
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Examples: “Like the action of the horses. The way they are standing.” 
“Like the way it’s painted. Like how the horses are kind of stamping around.” 

Perception of texture. Identification of texture but failure to relate it to the 
remainder of the composition. Examples: “ Tails. Bushy.’ “ Horses smooth.” 
“Texture. Stream rough.” 


B. Aésthetic. Responses indicative of (1) the arrangement 
of objects within the picture space to facilitate a unified 
impression; (2) the relative emphasis placed on different 
objects to facilitate grouping and sub-grouping within the 
picture; (3) the organization of the picture in the third 
dimension; (4) the selection of colors, forms, and values 
related to the whole (harmony); (5) the employment of 
sufficient contrast within the unitary organization to 
increase the interest of the picture (variety); (6) the use 
of pictorial elements to express mood. 


Space filling. Realization that each portion of the picture space must enhance 
the total composition. Examples: “ To fill the picture out more.” “It makes 
a difference in the landscape. If nothing in between have to have another hill 
or something.” “It would leave that space empty. Wouldn’t look good.” 

Distribution in space. Realization of the necessity of proper distribution of 
elements within the picture space. Examples: “Too crowded (if on one 
side).” ‘“ Don’t have all just up together in one place.” “ Don’t look good to 
have too much in a picture.” “Too small for the horses. Have to make it a 
bigger picture.” 

Grouping. Unification by placement of shapes within groups. Examples: 
“ Prettier in a group than if all over the picture.” “ Positions. If apart be 
three captains. If together just one.” 

Pattern of color. Balanced distribution of color over the entire picture space. 
Examples: ‘ Look funny, blue all in one place.” ‘ Looks nicer to have them 
scattered around like a patchwork quilt.” 

Repetition. Relating different portions of the picture by use of color, shape, 
or value in each portion. Examples: “Color alike. Blue up here and down 
here.” “No. It seems if two of one thing, want to have it the same kind.” 
“Color would not look like the other side.” 

Sequence. Unification of a number of objects by progression in size. 
Examples: “Sequence. Low head, higher, top one way up high.” “ Like 
these. They make a sequence of color. Light, dark, darker.” ‘“ Sequence. 
The hills. They get bigger as they go back.” 

Radiation. Unification of a number of objects into a larger shape character- 
ized by radiating form. Examples: “ The horses make a kind of radiation. 
They go out different ways.” ‘“ They look better. One head is going in each 
direction.” 

Geometrical form. Unification of a number of objects into a larger form 
such as a square, triangle, circle, S-curve, etc. Examples: “ Makes a triangle— 
horses.” “Mountain a circle.” ‘“ Horses make a circle.” “Bush. Like it’s 


round. Hills are round.” ‘“S-curve. Head of horse down through neck of 
other.” 
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Transitional line. Relation of the oblong picture shape to the general visual 
area. Example: “The hills round off the corners.” ‘ Transitional line. 
Ravine.” ‘“ So look like transitional around. Circular top.” 

Direction. Relation of objects by colors, values, or shapes leading from one 
to the other. Examples: ‘“ These horses sort of point to the middle one. 
Captain. Dominance.” “One pointing. The horse down to the ravine.” 
“Yellow back there. See it first, then look at the horses.” “ Blue in the corner. 
Look at it, then notice them.” 

Balance. Equilibration of elements in the picture space. Examples: 
“Would have been too much on this side. The picture wouldn’t balance right.’ 
“ The horses wanted balance—wanted hills.” ‘‘ Top-heavy.” 

Emphasis. Variation in stress to provide variation of interest-value. 
Examples: ‘“ No. There would be just the hill there. There would be too 
much hill.” “It makes the horse stand out too much.” 

Domination. Recognition of one portion of the picture as being most impor- 
tant but no recognition of the means employed to make it so. Examples: 


“ Horses. Supposed to be the main thing.” ‘“ Horses and boulders center-piece. 
Boulders and others background.” 


Domnunation by isolation. Examples: “Only thing in there except 
ground. Attract attention more.’ ‘ Horses the main things. 
other things you want to look at.” 

Domination by size. Examples: ‘Horses. Main object. Bigger.” 
“No. Horse takes up more room. Not well if don’t have enough of what 
painting about.” ‘“ Got captain. Dominance. Dark red horse. Up higher.” 

Domination by bright color. Examples: “Color of horses makes them 
stand out.” “ Brightest color.’ “ Background not so bright as other.” 

Domination by different color. Examples: “ Mostly green. Have to 
make it blue to show up.” “If background the same color it wouldn't 
show up. Different color.” 

Domination by different value. Examples: “Colors. The rest of the 
colors lighter. Horses darker.” “ Dark green wouldn’t look right against 
dark red horses.” ‘“ Around the horses, colors are lighter than others.” 

Domination by position in the center. Examples: “ That was his main 
topic, in the middle.” ‘“ Horses. Right in the center. See them better 
because first things seen.” 

Domination by position in the foreground. Examples: ‘“ Horses are 
more important in the picture so he put them up in front.” “ Because they 
would show up better in front than in back.” 

Domination through movement. Examples: “It is the most moving 
creature.” “Attract attention. Prancing around.” “ Action makes them 
show up.” 

Domination by framing. Examples: “ They have a sort of border on 
both sides.” “ Framing around the horses.” ‘“ Helps surround the picture.” 


No more 


Depth. Realization of the presence of three-dimensional relationships. 
Example: “ Hills. Really look like back of each other.” 

Rhythm. Movement through the relationships of pictorial elements. 
Examples: “ Look kind of rolly and fluffy. Like the waves. Flow like that.’ 
“Rhythm. Goes up horses’ backs.” 

Color harmony. Colors considered in terms of mutual relationship and not 
in terms of preference for isolated colors. 


Color harmony through the use of values of the same color or through 


the use of few colors. Examples: “If too many colors not look good.” 
“All the horses are different shades of red.” 
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Color harmony through the use of colors which are not contrasting 
colors. Examples: “ Green next to red is not as good as blue.” ‘“ Wouldn’t 
want blue and yellow, rather than blue and red.” 

Color harmony through the use of colors containing a common hue. 
Examples: “ Colors go good together. Pink and reddish pink.” “ Thought 


red and purplish and blue went good together.” ‘Colors matching. All 
have been darkened by a dark color.” “ Yellow will divide into all the 
horses.” 


Color harmony. Responses indicating that it is not obtained by the use 
of contrasting color. Examples: “If green it wouldn't go good with the 


horses.” ‘“ Red and green opposite. Together they would not look very 
well.” 


Variety of color. Preference for different colors, a dislike of too much of 
one color, a desire for more of a color. Examples: “ Better kinda yellow or 
brown but then, if all the same color, it would be rather dull.’ ‘‘ The middle 
horse is not the same color. You wouldn’t want the picture all the same color.” 
“It makes it prettier. So many colors in it.” 

Variety of color through contrast in color. Examples: “ Lot of green and a 
little red. Makes the horses show up.” 

Harmony of value. Examples: “ All sort of light so they go in good 
together.” ‘‘ Next to the horses. Because the horses are bright.” “If there 
was a real dark background and the horses were real light it would look funny.” 

Variety of value. Examples: “ Not put the sun in too many light colors. A 
little light and a little dark are better than all light.” “ The hills bring my 
attention. Some places are dark and others real light.” 

Harmony of Size. Examples: “.They (horses) are all the same size.” 
“ The hills go better with the horses. The horses are big and high. The hills 
are about level with the horses—to the top of the horses.” 

Variety of size. Examples: “ You wouldn’t want them all the same size.” 
“ The hills. They are not all the same size.” 

Harmony of shape. Examples: “ Harmony of shape. The horses are all the 
same shape. The ground. The hills.” “If hills were squares and the horses 
curved like—squares and curves don’t go together.’ “ Everything is like a 
circle. Round. Rock. Hill. Horses,” ‘“ Yes. How the land is painted. Not 
like real land. More like the way the horses are standing.” 

Variety of shape. Examples: “ You wouldn’t want them all standing the 
same way. One has his head down, one has his head turned.” “ Just makes it 
look better.” ‘“‘ Not the same thing over and over like a cornfield.” 

Mood through color. Use of color as a medium to express mood. Examples: 
“ Happy feeling. The way it’s colored. The colors they use in it. All the 
colors are joyous colors—real bright.” ‘“ Could have mixed a little brown with 
the horses, yet not so good, so joyful, that way.” ‘Gay and happy, full of pep. 
Bright color.” 

Mood through movement. Use of movement as a medium to express quality 
of mood. Examples: “ Probably wanted to show them peppy and raring to go. 
The way they are looking and the way their feet are.’ “ Jumpy. Horses so 
lively and everything. Want to turn somersaults. Looking different ways. 
Running around.” 

Mood through rhythm. Use of rhythm as a medium to express mood. 
Examples: “Happy. Lines curved. When sad, straight.” “Foxy. Hills look 
so good and bumpy.” “ Hills look like flying—joyful.” ‘ Looked stale. Has 
rhythm up there in the hills. Makes it feel gay and happy.” 
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More inclusive harmonies. Relating of one type of element to another type. 
Example: “ The colors fit what they (horses) are doing.” 


The child’s zesthetic analysis score was derived by determining 
the proportion of the total number of esthetic responses to the 
total number of responses made by the child. The significance 
of the difference between initial and final mean scores for each 
group and between the final scores of the two groups were 
obtained in the manner described on page 6. The results are 
presented in Table VIII. 

In Experiment III each group gained a significant amount 
between initial and final tests, and the difference between the 
final scores of the experimental and the control groups was sig- 
nificant. In Experiment IV each group gained a significant 
amount between initial and final tests but the difference between 
the final mean scores of the two groups was not significant. The 
initial score of the experimental group was much lower, however, 
than that of the control group. Accordingly the significance of 
the difference between the gains of the two groups was computed 
and is presented in Table [X. 

The chances that the gain in Experiment IV was significant 
proved to be as great as those in Experiment III. These results 
indicate that training in the principles of art does definitely 
increase the child’s ability, not only to understand an analysis 
of a picture, but to apply correctly the same type of analysis 
to a new picture situation. 

VII. The effect of the experimental procedure upon the 
approach to the picture. The scores on the compositional 
analysis responses may be used to check the validity of the 
hypothesis that, accompanying an increase in esthetic judgment 
and analysis scores, there is a decrease in the number of responses 
indicating that the picture is considered as a substitute-object, 
and an increase in the number of responses indicating that the 
picture is considered as an esthetic organization. The significant 
increase in mean scores on the McAdory Art Test and on the 
esthetic analysis responses indicated an increase in these two 
abilities. The changes in mean score in certain categories of the 
esthetic analysis classification were investigated to determine if 
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a change had actually occurred in the manner in which the child 
set about his examination of the picture. The mean number 
of responses for each category for series one, two, and three, of 
the administration of the questions to Groups A and B is 
presented in Table X. 

The significant categories to examine in this respect are those 
of interest in subject-matter, enumeration, naturalism, logical 
response, and drawing. In Experiment III, following the experi- 
mental period, there is a decrease in each of these categories 
with the exception of drawing. The mean change in score when 
the scores for these five categories were combined was —.34. In 
Experiment IV, following the experimental period, the mean 
number of responses decreased in the categories, interest in 
subject-matter, enumeration, and drawing, and increased in the 
categories, naturalism and logical response. The change in mean 
score of these five categories combined, however, was —.16. 
Even greater than the decrease in mean scores was the decrease 
in the proportion of such responses to the total number of 
responses, since the decrease on non-zsthetic responses was 
accompanied by a large increase in the number of esthetic 
responses. 

Probably, due to the nature of the test situation, the zesthetic 
analysis responses continued to be based to a greater extent on 
the type of responses characterized as substitute-object responses 
than in the case of the zsthetic judgment responses. In the 
esthetic analysis test-situation, a type of subject-matter, of very 
great interest to children of this age and accordingly likely to 
arouse many associations from past experience, was portrayed 
in such a manner that photographic representation was flagrantly 
violated. In contradistinction, the zsthetic judgment test con- 
sisted of a large variety of types of subject-matter, many of 
which are not inherently interesting to children of this age and 
was presented in such form that often alternative choices were 
equally acceptable photographic representations. Accordingly, 
although a direct inference from one situation to the other may 
not be valid, it does appear probable that in the esthetic judg- 
ment test as in the esthetic analysis responses fewer judgments 
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TABLE X. Mean scores for Groups A and B on each type of esthetic 


Category 


Non-esthetic: 


Unclassifiable 
Incorrect 
Subject-matter 
Enumeration 
Naturalism 

Logical responses 
Drawing 

Object variety 
Generalities 
Stereotypes, color 
Preference for the unusual 
Personal associations 
Enjoyment color 
Perception movement 
Perception texture 


Aisthetic: 


Space filling 
Distribution in space 
Grouping 
Pattern of color 
Repetition 
Sequence 
Radiation 
Geometrical form 
Transitional line 
Directional line 
Balance 
Emphasis 
Domination, general 
Isolation 
Size 
Bright color 
Different color 
Contrasting color 
Different value 
Center position 
Foreground 
Movement 
Framing 
Depth 
Rhythm 
Color harmony 
Through use of values of 
the same color 
Through use of colors not 
contrasting 
Through use of colors con- 
taining a common color 
Not contrast 
Color variation 


analysis response 


Testl Test2 Test 3 


02 
.04 


2.42 


1 
1 


% 


Color variation through contrast 


31 
.26 
44 
.33 
.16 


78 
49 
.07 
. 09 


4. 


.29 
.07 


Group A | 


03 
.00 
1.78 
.23 
we 
1.05 
.58 


05 
.10 


15 
15 


.05 


—S— .: 


. 00 
05 
. 56 
28 
95 
. 26 
39 
39 


Group B 


Test 1 Test2 Test 3 


.10 
14 
2.34 
24 
1.04 
1.18 
40 
.10 
78 
78 
02 
.08 
1.36 
i ¥ 
.02 


.70 
00 
. 04 
aR 
.26 
.02 


02 


.02 
02 


00 
04 
Se 
40 


.07 


‘ 
1.59 


1.89 
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TABLE X—Continued 


Harmony of value .04 .28 21 .02 00 00 
Variety of value .09 .10 .16 .08 .07 .06 
Harmony of size and shape .02 .00 .05 .00 .02 Al 
Variety of size and shape ie .48 .40 .16 .39 .30 
Mood through color 44 1.13 1.30 . 38 .49 .63 
Mood through movement %. 143: 2. —. 2 1.2 
Mood through rhythm .02 a .09 .06 00 .09 
Harmony of major qualities 00 .08 .07 00 00 04 


were based upon such criteria as subject-interest, photographic 
representation, and technical ability, and more judgments were 
based upon the criteria of the principles of art following the 
training period than before, and that the increase in esthetic 
judgment score was accompanied by this change in criteria. As 
the emphasis in the experiment was placed upon the explanation 
of positive methods of organizing the picture and not upon the 
inadequacy of substitute-object criteria, the results seem to con- 
firm to a certain extent the assumption made at the beginning 
of this report that the two are mutually exclusive. 

VIII. The effect of the experimental conditions upon the 
development of the ability to produce an esthetically meritorious 
composition. The ability of the child to organize an original 
painting according to zsthetic relationships was also tested before 
and after Experiment III and after Experiment IV. The chil- 
dren were tested in groups of six or eight children at a time. 
Each child was provided with an easel, brush, and paper. A 
large supply of various paints was placed on a table for the use 
of all of the children. After the children had been assembled 
the following directions were given: 


We are going to paint a picture today. First, I will tell you how to use 
these paints. You dip your brush in a glass of water, and wipe it off carefully 
on the edge of the glass so that there is not too much water in the brush. lif 
there is too much water the paint will run down your picture. Then dip just 
the tip of the brush into the paint jar and start to paint. If the color is too 
dark, put more water on your brush. If the color is too light, do not put so 
much water on your brush. You may take one jar of paint at a time. As soon 
as you have finished with that color, bring it back to the table and take the 
next color of paint that you wish to use. 

Now let me tell you what I want you to paint. You may paint an illustration 
of some story you have been reading such as William Tell or Robinson Crusoe, 
or you may paint a picture about winter sports or some other subject that you 
like. I want you to put at least two objects in your picture, and to put in a 
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background; that is, to put in hills, or trees, or water, or whatever you would 
like for a background for your main objects. 

As we have only a short time to paint we will have to make a sketch. Have 
you made sketches in school? When you sketch anything, you paint in the main 
objects of your picture quickly and do not try to paint every little thing in the 
picture. Put in the main objects first, and then fill in the background. 

I will give you a minute or two to decide what you would like to paint. Then 
decide which color you will need and take a jar of that color of paint to your 
easel and begin. Remember to put in at least two objects first and then fill in 
the background. You will find all of the other things you need for painting at 
your easel. You will have fifteen minutes in which to paint your picture. 


All of the paintings, including three for each child, were 
judged at one time. The judges were ten research assistants in 
the Psychology of Art division at the University of lowa. These 
judges were artists, or were individuals possessing considerable 
experience in judging children’s paintings. They represented 
a wide variety of points of view and training. The following 
instructions were given for the ranking of the paintings. 


These paintings were made under .the following conditions. The children 
were allowed 15 min. in which to paint. They could paint any subject-matter 
they desired although illustrations of stories such as William Tell or Robinson 
Crusoe, or the subject “Winter Sports” were suggested. Two conditions must 
be fulfilled: (1) there must be at least two objects in the picture, and (2) there 
must be a background. They were prohibited from painting anything in the 
room. 

Due to the limitation in time and to the fact that water color was a new 
medium to the children, the paintings are necessarily very rough. Accordingly, 
it may be necessary to be quite generous in your judgments in order to differ- 
entiate the paintings sufficiently. Disregard skill in technique and representa- 
tion and base your judgments upon the relative attainment of composition, 
involving rhythm, balance, unity, harmony, cfc. Any consideration of what a 
fifth-grade child should do should likewise be disregarded and judgments based 
entirely upon the relative merit of the compositions within this group. 

You are to group the paintings into nine piles, representing nine degrees of 
merit from very poor through a mediocre performance to very good. Reserve 
the lowest pile for pictures entirely lacking in composition, the highest pile for 
pictures you consider to exhibit outstanding merit. It may be helpful to 
separate the pictures into three piles and later to sub-divide each of these piles 
into three degrees of merit. After you have completed your ranking, go 
through the pictures again and again, redistributing the pictures until you are 
relatively sure that further examination would not change the placements to 
any great extent. 

When you have completed the ranking process, tabulate the paintings within 
each pile by recording the number stamped on the back of the picture. The 
group representing the lowest degree of merit will be designated Group 1, that 
representing the highest degree of merit, Group 9. 

When the recording has been completed, shuffle all the pictures together so 
that no trace of your ranking remains. 








ART APPRECIATION PROCESS AT THE CHILD-LEVEL © 33 


The final score for each painting was in the form of an 
average of the ranking of all the judges. As there was no 
suitable scale by which the paintings could be rated there were 
distinct differences in the form of the distributions..* In com- 
bining the judgments, the scores for each judge were transmuted 
into sigma values on a normal distribution and these values were 
averaged to give the final score of a painting (15, pp. 155--160). 
The reliability of these judgments, as measured by correlating 
the average of five judges with the average of the remaining 
five judges, was .62+.03. The reliability of the final score, as 
determined by the average of the scores of the ten judges, was 
determined by the application of the Spearman-Brown prophecy 
formula for double the number of judgments to the above 
correlation. The resulting value was .77. 

The significance of the difference between initial and fina! 
scores of each group, and of the difference between the final 
scores of the two groups was determined as before. The results 
are presented in Table XI. These results fail to indicate that 
a significant difference occurred either between initial and final 
tests within each group or between the final tests of the X and 
C groups. When the significance of the difference between the 
gains, shown in Table XII, was computed the probabilities seem 
sufficiently high to indicate that the experimental procedure did 
produce a change in the ability of the child successfully to 
organize a composition. Apparently the experimental conditions 
were detrimental in their influence. However, it is probable that 
practice is fully as important in production ability as in zsthetic 
judgment ability. The experiments provided for a large amount 
of practice in the recognition of the principles of art. It is 
recognition that is involved in zsthetic judgment and analysis. 
There was, however, no opportunity provided in which the child 
might attempt to apply the principles in the composition of his 
own paintings. Without such opportunity for practice in appli- 
cation, it is possible that an abrupt attempt at such application 


13 Due to the sketchiness of the paintings the Iowa Drawing Scale could not 
be used, as the paintings all tended to fall into the lower classes so that sufficient 
discrimination could not be obtained. 
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within the very short space of time allotted for the production 
of the painting confused the child to such an extent that a 
decrease rather than an increase in the quality of the composition 
resulted. 

IX. The permanence of the increase in esthetic judgment 
score produced by Experiment II. As both groups of children 
received training in Experiments III and IV a retest some time 
after the conclusion of these experiments was not expedient in 
that there was no standard by which to compare their final per- 
formance. In Experiment I, however, the training of the A 
group had resembled so closely in nature and in results a simple 
repetition of the test, that a retest of the groups used in Experi- 
ment I and II was feasible. Accordingly a retest of both groups 
was made approximately six months following the conclusion 
of Experiment II to determine the stability of the increase in 
esthetic judgment produced by the experimental procedures. 

The results, presented in Table XIII, indicate that the differ- 
ence between the two groups had now decreased to 1.11 and was 
no longer significant. It appeared that the understanding as to 
the proper criteria for zsthetic judgment and analysis produced 


‘by the experimental techniques was not sufficient to continue in 


operation during the intervening summer vacation period and the 
activities of the subsequent school year, neither of which involved 
this type of training. On the other hand, the gain of the control 
group is probably that of the natural increment, that is, the 
increase in score from grade to grade characteristic of the 
McAdory Art Test. Although there are no data upon which to 
base any definite conclusions as to what actually occurred to 
bring about these results, the theoretical explanation is advanced 
that unless a situation is provided in which a sufficient oppor- 
tunity is furnished for continuous practice in the recognition of 
these principles, failure to progress to a significant degree in the 
ability to judge zsthetically ensues. 

X. Summary and conclusions. This investigation was designed 
to determine the effect of two factors, direction as to the proper 
criteria of art and practice in the perception of manifestations 
of these criteria, e.g., in the development of zsthetic judgment, 
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esthetic analysis, and the production of an esthetically meri- 
torious composition. The subjects were children of the second, 
third, fourth, and fifth grades in two typical lowa schools. These 
subjects were divided into two matched groups on the basis of 
erade, sex, IO, MA, CA, and McAdory Art Test scores. The 
effect of a knowledge of the principles of art and of practice in 
the recognition of applications of these principles was tested by 
a series of four experiments. In Experiments I and II the sub- 
jects were tested before and after each experiment by a modified 
form of the McAdory Art Test and by two subsidiary tests: an 
arrangement of items from the Whorley Unity Scale, and the 
Lewerenz Vest in Fundamental Abilities of Visual Art, Part I, 
Test 1. In Experiments III and IV the subjects were tested 
before and after each experiment by the standardized form of 
the McAdory Art Test, by a series of questions requiring an 
zesthetic analysis of Franz Marc’s painting Red /Torses, and by 
the production of an original composition. 

In Experiments I and II the experimental groups were pre- 
sented with a series of paired or multiple-choice exercises, one 
member of which exemplified, the other of which violated, a 
principle of art. In Experiment I the subjects were required to 
judge upon the relative merit of the members of each exercise 
but were not provided with any explanation of the factors upon 
which the judgments should be based. In Experiment II the 
subjects were required to judge as to the relative merit of each 
portion of the exercise, as in Experiment I, but they were also 
directed as to the factor upon which the judgment should be 
based by instruction as to the principle of art which was involved. 
In Experiment I no significant difference between the gains of 
the experimental and control groups occurred, while in Experi- 
ment II a significant difference did occur. Variations in Experi- 
ment II indicated that the gain in esthetic judgment score was 
also dependent upon the factor of practice in the recognition of 
the principles, since a group having judged more than 90 exer- 
cises gained in esthetic judgment score to a greater extent than 
a group having judged fewer than 90 exercises. 
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To determine whether increases in zsthetic judgment score in 
Experiment II were a function of knowledge of and practice in 
the recognition of the principles of art, and not a function of 
coaching upon exercises similar to the test exercises, a com- 
parison was made of the gains upon sub-tests involving different 
art elements. A significant increase in esthetic judgment score 
occurred upon portions of the test situation which dealt with 
applications of the principles to elements not similar to those 
involved in the experimental exercises. 

In Experiments [III and IV the groups were presented with 
the same explanations as to the principles of art involved in the 
exercises. In Experiment III the subjects were required to 
abstract a common principle from a number of illustrations which 
varied in many other respects. In Experiment IV the subjects 
were required to note the presence of a certain principle of art 
in one member of the exercise and the absence of this principle in 
the other member in order to judge as to their respective merits. 
In Experiment III, although the exercises differed considerably 
from the exercises’ involved in the test situation, the significance 
of the difference in gain between experimental and control groups 
was approximately the same as that in Experiment [IV in which 
the exercises were more similar to the test exercises. Cultivation 
of a comprehension and recognition of the principles of art there- 
fore seems to bring about an increase in esthetic judgment 
regardless of the type of exercises which may be utilized to 
illustrate the principles. 

An analysis of the responses to the picture Red Horses demon- 
strated that, following the experimental period, the subjects pos- 
sessed a much greater understanding of what is involved in the 
analysis and judgment of xsthetic merit than before-and that, 
accompanying an increase in the number of responses indicating a 
comprehension of the picture as an organization of esthetic rela- 
tionships, the number of responses indicating a comprehension 
of the picture as a substitute-object decreased in number and in 
proportion to the total number of responses. 

Significance of the differences in mean ratings of the paintings 





ee 


a a te 


Soe Yee 


ql 








38 MILDRED DOW VOSS 


produced before and after the experimental procedure did not 
indicate an increase in production ability similar to that secured 
in zsthetic judgment and analysis abilities. ‘There was appar- 
ently a tendency for the experimental group to lose slightly in 
mean score and for the control group to gain. Failure of the 
experimental procedure to provide opportunities for application 
of the principles to production may account for the result. For 
an adequate understanding of the effect of a knowledge of the 
principles of art upon production, a more refined experimental 
procedure, accompanied by opportunities for application, is 
needed. 

The following conclusions seem to be justified by the fore- 
going results: (1) The principles of art can be explained in such 
a manner that they can be understood and applied by cliildren in 
the second, third, fourth, and fifth grades. (2) An understand- 
ing of the principles of art produces a significant increase in the 
ability to judge between varying degrees of merit. (3) An 
understanding of the principles of art produces a significant 
increase in the ability to analyze the zsthetic’ organization of a 
picture. (4) Accompanying a significant increase in the esthetic 
judgment and the zsthetic analysis scores a decrease occurs in the 
comprehension of the picture as a_ substitute-object and an 
increase occurs in the comprehension of the picture as an esthetic 
organization. (5) An explanation of a principle of art accom- 
panied by illustrations exemplifying its application in many ways 
brings about a greater increase in zsthetic judgment than an 
explanation accompanied. by illustrations exemplifying its appli- 
cation in but few ways. (6) An understanding of the principles 
of art without opportunity for the application of these principles 
in the production of art is not sufficient to increase the ability of 
the child to organize an esthetically meritorious composition, as 
such ability was measured in this experiment. (7) With com- 
plete cessation of the type of activities provided by the experi- 
mental procedure over a considerable period of time, there is 
failure to progress to a sufficient extent so that the experimental 
group retains a significant superiority over the control group. 
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EMPATHIC RESPONSES IN CHILDREN ! 


by 


WILLIAM E. WALTON 


I. Introduction. Previous studies (6, 9, 12, 14) of empathy, 
as it is concerned with graphic art, have been made on adult 
subjects. This investigation -was planned to explore the child 
range from kindergarten through the twelfth grade to discover 
its genetic development and individual differences at these age 
levels. In order to interpret children’s responses adequately, 
responses were also secured from adults. It was believed that 
such a study might furnish additional information concerning 
(1) the nature of the empathic response, (2) the differences in 
response between artistic and non-artistic children, and (3) the 
relationship between the usual lines and colors and facial forms 
of expression.* 

Empathy is expressed usually as a process of “ feeling oneself 
into” the perceptual object. Physiological responses, probably 
visceral and kinzsthetic in nature, take place. However, a lack 
of recognition of their physiological character causes identifi- 
cation of these empathic responses with the object. As Langfeld 
cites, ‘‘ The perception obtained by the roundness of an object, 
or other factors of shape, depends upon similar processes. When 
we notice the smooth curves of a marble torso, we can probably, 
if we observe carefully, get a fleeting image of our hands moving 
in imagination around the figure’’(12). We speak of lines as 

1An investigation in the research program, Genetic Studies in Artistic 
Capacity, directed by Dr. Norman C. Meier and sponsored by the Carnegie 


Foundation for the Advancement of Teaching with financial aid from the 


Carnegie Corporation of New York. Later sections of this study were com- 
pleted at the University of Nebraska. 


* This investigation is related to earlier studies reported in Psychol. Monog., 
45, 1933, No. 1. 


3’ The Ruckmick series of facial expressions were used in part for this portion 
of the investigation. 
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being ‘sad,’ ‘merry,’ ‘angry.’ Empathy is not to be confused 
with associations which are purely mental. A child, for example, 
may not like red because the maid in the family wears the 
color (16). Empathy is closely related to mimetic responses 
which are responses made in imitation, either directly or through 
analogy, of some associated quality. ‘Loud’ colors, for example, 
are usually associated with noisy individuals, colors decorating 
noisy places and, in general, unrefinement and crudeness (1, 3, 4). 
It is believed that empathy is related to artistic ability in that the 
more artistically talented individual is the one who can more 
readily and completely identify himself with the object or situ- 
ation. Langfeld (13), Chandler (14), and Birkhof (2) regard 
empathy as a factor in esthetic responses, such as rhythm and 
balance. Because of the importance of empathic use of lines in 
cartooning and in the judgment of facial expressions, the writer 
also included a study of the judgment of facial expression. 

IT. Technique: construction of tests; preliminary study. On 
the assumption that test items used by previous experimenters 
upon adult subjects were too difficult for kindergarten children— 
as having the child draw a ‘merry line ’—a simple dual choice 
method was adopted instead in which the child would designate 
which line he thinks is ‘merry.’ Ninety-three dual choice situ- 
ations with accompanying questions were formulated. To insure 
comprehension at the first grade level, all words used in the ques- 
tions were checked against the Horn word-list.4 The situations 
included lines, forms, and colors and generally recognized prin- 
ciples of art presented in simple line drawings. Four situations 
are presented in Fig. 1. The child’s score was the number of 
responses made correctly. The correct choice was determined 
by the judgments of a group of colleagues in the field of the 
psychology of art. 

The test proved too easy, since the preliminary group was 
approximately 93 per cent correct in its responses, necessitating 
the use of more difficult materials and a technique which would 
call for a more precise use of the empathic factor. 


4Horn, M. D. A study of the vocabulary of children before entering the 
first grade. Baltimore, International Kindergarten Union, 1928. 
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The final form of the test included four parts. The first called 
for the child’s response to adjectives by drawing lines descripti¥e 
of the adjective. In the second, test lines and colors were shown 
and the subjects were required to select from pairs of words the 
word having the same meaning as the presented line or color. 
Part three called for spontaneous verbalization in response to lines 
and colors. Test four employed the Ruckmick pictures and 
followed the technique of Gates (1). 
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Fic. 1. Examples of items used in preliminary tests. 


Questions used with the items shown above: 
Which box do you think is the heavier? 
Which boy do you think is feeling good? 
In which house do you think the cross man lives? 
In which house do you think the happy girl lives? 


SOW> 


Construction of Test I. Test I consisted of 12 adjectives, 10 
of which were selected from Lundholm’s list. Lundholim’s com- 
plete list could not be used as a comparison of his adjectives with 
Horn’s word list ® showed that only 10 of the 48 were under- 
stood by children before entering the first grade. The two 
additional words, ‘tired’ and ‘happy,’ could have been used by 
Lundholm and in all probability would have been classed under 
groups III and IV respectively. Table I shows the lists of 
adjectives used. 

The teacher introduced the experimenter, telling the children 
that he wanted them to take part in a little experiment or exer- 
cise. E then gave the following instructions: 


5 Horn, M. D., op. cit.. 
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If I were to ask you to draw a line, you might make one like this (EF 
draws a straight line on the blackboard); others might make one like this 
(E draws a wavy line); still others might make lines like these (E draws a 
series of variously shaped lines). You see there are lots and lots of lines which 
can be drawn. (E£ erases all the lines from the blackboard.) Now I am going 
to give you some words and I want you to draw a line which you think means 
the same thing as the word. Think what the word means and then draw a line 
which means the same as the word. If you can’t be sure, guess. Now in 
space No. 1 I want you to draw a quiet line, a line which you think is quiet. 
(Pause, while line is being drawn, then—) There, that’s fine. Now, in the 
second space I want you to draw an angry line, a line which you think is 


angry (etc.). 

Care was taken not to build up the empathic state through 
emphasis upon the word. While there were few children in any 
grade who did not understand the meanings of the words, there 
were many who confused the words. The word ‘merry’ was 
frequently used as ‘marry.’ Another word, ‘hard’ caused 
trouble in that the children tended to think only in one way, 7.¢., 
if they thought ‘ hard sidewalk ’ they did not think ‘ hard lessons, 


' hard to get u %y etc. Accordingly the word was alwa S used in 
dD o*J 
several sentences. 


TABLE I. List of adjectives used in Test I 


1. Quiet 5. Sad 9. Hard 
2. Angry 6. Dead 10. Strong 
3. Lazy 7. Playful 11. Weak 
4. Merry 8. Tired 12. Happy 


The order was arranged by chance. 


Mimeographed data sheets were provided on which were 
recorded age, sex, school, grade, etc. Test I required some 
limiting device such as the rectangular space on the data sheet 
since the size of the angles, length of curves, and degree of slant 
are dependent upon the size of the space in which they are drawn. 

Devising a scoring system for the line drawing test presented 
a number of difficulties. Against the procedure of scoring 
responses in terms of the group response were the objections: 
that the ‘ group’ would include many ages from kindergarten to 
college subjects and that the scores would be based to a certain 
extent upon the non-empathic responses of the lower grades. 
The best procedure seemed to be that of building a scale of values 


for each item based upon the work of Lundholm, Poffenberger, 
and Barrows. 
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A scale of seven steps was constructed varying from one con- 
forming to the greatest extent with previous investigators’ classi- 
fications to one representing the opposite feeling. Examples of 
the scoring scale for items 3 and 12 are shown in Fig. 2. 


Lazy Happy 





SCORE 
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Fic. 2. Examples of scoring scale. 


Construction of Test II. Test II reversed the dual choice 
technique employed in the preliminary tests which had proved 
to be too easy. Twelve lines were drawn and six colors placed 
upon white drawing paper 6x 8 in. This was mounted in turn 
upon heavy gray cardboard so that the drawing was centered 
when it was projected by a Spencer delineascope on a Translux 
daylight screen. This projection enabled everyone in large 
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groups to see clearly the direction, form, and type of line. In 
smaller groups the cards were presented directly to the subjects. 
There was no difference in results between these methods of 
presentation. Each child was provided with a mimeographed 
list of paired words one of which was the correct response. The 
instructions for giving the test were as follows: 

Last week you drew some lines for me- when I gave you some words. Today 
I am going to show you some lines which I have drawn and I shall also give you 
pairs of words for each item. I want you to place a circle around the word of 
each pair which you think means the same thing as the For example, the 


first two words are happy-sad; if you think the line is happy, put a circle around 


the word happy, if you think the line is sad, put a circle around sad. If you 
cannot be sure, guess. 


The test was scored as the number right. Line, color, and total 
scores were treated separately. The items and correct responses 
are shown in Plate I. 

Construction of Test Il]. Test Ill was based on the assump- 
tion that when the subject is shown a line or color, the first words 
of a brief spontaneous report of his reactions would indicate the 
empathic response and its strength. In order to eliminate the 
imaginative response, an entirely different mental response from 
the feeling response which it is assumed appears first, the time 
interval for response must be extremely short. 

The following instructions were used in giving the test: 

The first time I has here, I gave you some words and you drew lines meaning 
the same as the words; the next time I showed you some lines and colors and 
you put circles around the correct word. Now we are going to try something 
new,—l am going to show you some lines and I want you to tell me how you 
think you would feel if you felt the way the line feels. This may sound silly 
but let’s try it, just tell me how you think you would feel if you felt the way 


the line feels or tell me how you think the line feels. This is a little harder than 
the other tests, so do the very best you can; if you can’t be sure, guess. 


Scoring such a test is difficult and is open to the objections 
that the scorer may change his criterion or that his scoring is too 
objective. The first difficulty was avoided by formulating a 
scale by which all papers would be scored. A frequency chart 
was made of the responses of 600 subjects. When these were 
studied and compared with the reports of other investigators of 
adult subjects, there was in general a close agreement between 
the highest frequencies and what has been accepted as the most 





46 


Nuh whe 





WILLIAM E. WALTON 





Happy 


Weak 
Merry 
Old 
Active 
Noisy 


PLaTtE I. Items employed in Test IT. 


Correct answers: (see text for explanation). 


7. Lively 13. Warm [orange red] 

8. Not Angry 14. Weak [blue green] 

9. Playful 15. Heavy [violet blue] 
10. Not Lazy 16. Happy [orange yellow] 
11. Not Tired 17. Noisy [red violet] 


12. Hard 18. Not Playful [green yellow] 
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common empathic reaction to lines and colors. There seemed 
also to be a relationship between the frequency of the words or 
expressions and their suitability. Thus for a line to which the 
majority gave the word merry, the word good, given by a smaller 
group, would be less appropriate, while the words feeling bad, 
given by a very small group, would be inappropriate. This rela- 
tionship of frequency to suitability was not always the rule, how- 
ever, for there were many wnique responses. These might cor- 
respond to adult reactions as the response agitated rather than 
the usual one, angry. “ The line feels like its going to have a 
new dress ”’ is unique but is as correct as joy. 

The nine lines and five colors selected are shown in Plate IT. 

After a complete analysis of the responses by three judges, a 
seven point scale was drawn up upon which each of the 75 or 
more responses given for each stimulus was classified. Upon 
this standard the performance of each subject was judged. 

Construction of Test IV. Children simulate the emotion being 
portrayed in a picture, especially in the simpler emotional states. 
Langfeld says, ‘The meaning of facial expression is learned 
from the movement of one’s own face, or a tendency toward such 
movement as suggested by the lines of the face observed.®”’ It 
has been suggested that there can be no interpretation of facial 
expression unless the state is induced in the subject in a kind of 
implicit behavior. Inasmuch as the above involves empathy and 
as lines and surface shapes are present in the picture of a facial 
expression, it was thought that such a test might be related in 
some way to the other tests being employed. Those tests involved 
single lines and no surface shapes. In the last test the interpre- 
tation would be based possibly upon the configurational pattern 
of the lines and shapes. 

Ruckmick (13) prepared a set of 34 pictures expressing as 
many emotional states. Gates (6) made use of ten of the 
Ruckmick pictures ‘ in testing the ability of 902 school children 
and 86 college students to interpret facial expression. The 
present investigation selected 20 of the Ruckmick pictures for 


6 Also found in H. S. Langfeld, The judgment of emotions from facial 
expressions, J. Abnorm. Psychol., 13, 1918, 172-184. 
7 These may be obtained from the C. H. Stoelting Co., Chicago. 
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Test IV, the more difficult ones, those which were quite similar 
in expression to others in the accepted group being omitted. Care 








PLaTeE II. Items employed in Test III. 


Best answers: (see text for explanation). 


1. Happy, glad, strong 8. 
2. Strong, hard, heavy 9, 
3. Playful, merry, happy 10. 
4. Sad, weak, lazy, tired 11 
5. Strong, heavy, hard * 12. 
6. Happy, merry, playing 13. 
7. Weak, thin, light 14. 


Happy, merry, jolly 
Mad, angry, hot 
Cool, light, weak [blue green] 


. Happy, merry, gay [orange vellow 
a o oJ - 


Sad, unhappy, weak [green yellow] 
Happy, clean, cool [white] 
Warn, hot, happy [orange red] 


* Usually prefixed by the word ‘ real,—‘ real’ strong, ‘real’ heavy, etc. 


was taken to avoid sequential presentation of two similar facial 
expressions. The instructions were similar to those employed 
by Gates, being adapted, however, to the child-level. 
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The instructions for giving the test were as follows: 


Today you are to take the fourth and last test. This one is different from 
the others. There are 20 pictures of an actress. I want you to look at each 
one and tell me what you think the lady is doing, what she is thinking about or 
how she is feeling. Answer any one of the questions, “What ts the lady 
doing?” or “What is the lady thinking about?” or “How is the lady feeling?” 
Write as much as you care to in the space provided,—one word or a whole 
sentence. If{ you can’t be sure, guess. [In the lower grades, 1.e., up to and 
including the 3rd grade, the children were asked: “ How do you think your 
mother would feel, or be thinking about, if she looked the way the lady looks? 
Now how do you think the lady is feeling, thinking about, etc.’’] 


Test IV encountered the same scoring problem as Test III. 
It was assumed that there is more than one correct response for 
each facial expression and that the responses will range them- 
selves along a scale from good to bad. Frequencies were found 
for each response to each item, using the data from 500 subjects. 
The majority in each instance corresponded with Ruckmick’s 
list. “On the basis of frequency and suitability scoring scales were 
constructed. The word suitability is used in a special sense in 
that it would be incorrect to depend upon frequency alone as such 
a procedure would not allow for wnique responses, such as 
accompany the pictures shown in Fig. 3. 

Reliability of the scoring system. The responses were referred 
to the key or scale in Tests II, III, and IV so that the only sub- 
jective error occurring would be in the scoring of the unique 
responses. It was necessary in these cases to understand the 
general meaning and then to find the word on the list best fitting 
the response. For example, the statement “ She looks like she 
has been home all day with nobody to talk to,” was set down as 
lonesome and scored accordingly. The original classification 
was kept as objective as possible by the utilization of the fre- 
quency charts, by reference to the work of previous investigators, 
and by employing three trained judges in the classification of 
each response on the scale of values. 

Test I involved greater subjectivity in judgments. Each 
response had to be matched to the scale of values for that item. 
As no two lines are exactly alike, interpretations must be made 
as to curvity, direction, character of irregularities, etc. Accord- 
ingly 100 papers were scored by two competent judges. The 
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correlation between their judgments was .92+.006 which would 
be considered satisfactory. 

A study was made of the comparative merits of the scoring 
used by Guilford and Guilford (4) ina test for students of design 
and the scoring system used in this investigation for Test I. 
Three factors were noted in their responses: (1) the form of 
the lines, as angle, curve, wave, or line; (2) the direction, as 
horizontal, up, and down; (3) the type, heavy, medium or light. 





Fic. 3. Children’s reactions to the above pictures involved such unique 
responses as: “She is thinking of God,” “ She is looking at her father 
and she loves him,” “ She is thinking of her sweetheart,” given for adora- 
tion; and “ She is ‘high-hatting’ some one,” “ She is acting stubborn (or 
saucy, smarty, haughty, snobbish, or determined),” given for defiance. 


After a statistical study of the responses of a large number of 
subjects, weightings were determined for each of the ten vari- 
ations possible with each adjective. Thus the word sad might 
produce a light line, extending downward. This would give 
values of two, four, and two, or a possible score of eight. 

When a comparison was made of the lines illustrating the 
highest possible score and the worst possible score using the 
Guilfords’ method and the Walton method, it was found that the 
two methods gave identical results. To find if differences 
occurred between these extremes a statistical study was made of 
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the eight identical words in the two tests. The correlations 
between the two methods of scoring are shown in Table II. In 
general, the agreement is good. 


TABLE II. Correlation between two methods of scoring 


Test Items No. of Subjects Correlation PE. 
Quiet 95 713 .034 
Angry 95 .690 .039 
Lazy 94 .546 .051 
Sad 94 .647 .039 
Dead 95 .433 .057 
Playful 95 421 .057 
Hard 95 773 st 
Strong 93 .660 .039 
Total Score 151 .696 .027 


A study was made of the systems as used in scoring the tests 
of artistic and non-artistic groups. The Walton system of scor- 
ing had revealed differences between artistic and non-artistic 
children in the lower grades, but it did not reveal such differences 


between the artistic and non-artistic groups at the junior high 
and university levels. 


TABLE III. Comniparison of two systems of scoring * 








Artistic Group Non-artistic Group Chances in 
a —. ~ cr «. _—; 100 ofa 
No. Av. ¢ S§S.D. No. Av. ¢ §.D. true diff. 
Children f G 34 48.67 .98 5.71 34 48.82 .94 5.47 54 
= W 34 45.88 1.27 7.41 34 48.53 1.46 8.54 91 
Adult £ G 39 52.64 .76 4.73 42 51.14 .64 4.15 93 
ma W on Se.0a. 40 4f4 42 53.24 .94 6.11 89 


ae CC 
t University Students. 

Table III shows that only the Guilford system approximates a 
significant difference in favor of the artistic group. The higher 
score exhibited by the non-artistic children on the eight items 
when the Walton scoring was used are completely out of harmony 
with all other results obtained. : 

In spite of the superior diagnostic property of the Guilford 
system in the adult range, because of the agreement between the 
two systems of scoring, the fact that the majority of children in 
the lower grades seem to differ somewhat from adults in the kind 
of lines drawn and the fact that, since growth is characteristic 








52 WILLIAM E. WALTON 


of this ability, it is difficult to establish an age basis for weight- 
ing, the Walton system was used in the scoring. 

Reliability of the tests. The results of repeating the tests are 
summarized in Tables IV and V and in Fig. 4. Repetition of 
the tests increased the average test score. 

In individual cases lower scores usually were the result of 
‘intellectualizing ’ the response in an effort to improve test per- 
formance instead of continued dependence upon feelings. 


vest 1 Mea 

TEST IT YE A@E@@EAZ Zz 
rest 111 Cea 

Ul YUU WWMM, 


50 60 70 80 90 100 
SCORE 


TEST IV 





Fic. 4. Change in test score. The shaded line indicates the first trial, the 
solid line, the second. 


TABLE IV. Showing résults of repeating the tests 








Trial 1 Trial 2 
a A * lf aie ~— 
Test Number Av. o S.D. Av. o S.D. 
I 59 71.89 1.43 11.05 81.55 .87 6.70 
II 59 16.53 .20 1.58 17.18 1.38 1.06 
III 62 80.35 1.27 10.00 86.89 1.05 8.30 
IV 65 80.56 .93 7.52 82.23 .94 7.45 


The correlation between initial and re-test scores of 318 sub- 
jects in the Cedar Rapids schools was .40+.03. Failure to 
secure a higher correlation may be due to the fact that variations 
may occur in the reports of subjects which are due, not to the 
way the test is constructed or scored, but rather to the variation 
of the subject’s feeling state as shown by Langfeld (11) and 
Ruckmick (13). 

Curves of distribution are shown in Fig. 5, with the average 
and P.E. for each curve. The curve for Test II is skewed to 
the left, a statistical treatment of the data giving the skewness 
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as —.55. The other curves give the following measures of 
skewness: Test I=.05, Test III=—.34, Test IV=.15. 


TABLE V. Reliability coefficients of correlation 








Repeated Split-half Method Whole Test 
"my ae aor r _ a“, (tinier 
Test N r P.E. N r PE. Ts P.E. 
I 318 .40 .03 — — — — — 
I 59 .26 .08 99 ae .06 .48 .05 
II 59 .43 .07 96 31 .06 .47 05 
III 62 | .07 98 .59 .04 74 .03 
IV 65 .70 .04 97 .29 .06 43 .05 
84 
Legend 
Test | — 
™ test H <-- 
Test Ill -—-— 
4 mn Test IV 
Test Av. av.P 
| 565 .56 
Sas Il /2.0 .12 
3 « Ill 674 70 
Wy V 549 53 
> 
WI 36 
i 


] 
# 





12 
(6) 
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Fig. 5. Distribution of scores 


III. Technique: factors investigated; appearance of the 
empathic response. Children in preschool laboratories of the 
University of Iowa and the kindergarten of Harrison School, 
Cedar Rapids, Iowa, were tested to discover the first appearance 
of the empathic response and its nature. Tests II to IV were 
given individually. 

‘Growth in capacity for empathic response. The following 
groups were tested: grade school children from kindergarten up 
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to VIA in Harrison School; Junior High Schools of Lincoln, 
Nebraska, and Des Moines, Iowa*; several hundred university 
students. 

Artistic ability and empathic responses. Groups for the 
studies of artistic ability and empathic response were selected 
from four schools: first, the ‘X’ and ‘Z’ subjects ® from Iowa 
City, Iowa; two groups of children, one artistic and the other 


Sa Laenaenaend | aon ane an aed ~ 


_—we 
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' ++- tee 


i j 
po ate 
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AGE 


Fic. 6. Relationship between age and score. 


non-artistic from the Cedar Rapids schools ; two ‘ paired’ groups 
of children representing the most and least artistic of 3,000 
Lincoln school children; and fine arts and college students. This 
gave us an opportunity to study four artistic groups and four 
non-artistic groups of equal mental ability but different ages. 
Other factors studied. A number of our subjects had been 
given a color harmony test '? during some previous experiments. 
The scores on this test were compared with those of Test I and 


8 A study made by Dr. Meier. 


9 See C. Tiebout, The psychophysical functions differentiating artistically 


superior from artistically inferior children, Psychol. Monog., 45, 1933, No. 1, 
108-133. 


10 See p. 64, footnote 13. 
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Part 2 of Test II to determine whether or not similar factors 
were involved. School grades and intelligence were compared 
also with empathic scores. Finally relationships between the 
tests and analyses of the responses to each item by all subjects 
and artistic and non-artistic groups were studied. 

IV. Results. Fig. 6 shows the relationship between age and 
test score. 

Genetic development of empathy. Correlations between age 
and empathic response are shown in Table VI. 


TasBLE VI. Showing correlation with age 


Test N r P.E. 
I 486 .466 .025 
II 473 .407 .026 
III 431 .325 .029 
IV 461 .416 .026 


Table VII shows the average, o and S.D. for each age-group. 
Those from 6 to 13 are from Harrison School, twelve to fourteen 


TABLE VII. Showing average scores at various age levels 

















Test I Test II * 

Age- cr tin ~ on oe 

Group N Av. oav. S.D. N Av. o av. S.D. 
6 49 47 .50 1.60 8.70 47 9.68 . 36 2.44 
7 59 48.25 1.23 12.40 57 10.48 ae a 
8 83 54.15 1.23 11.25 82 11.24 ys" 2.30 
9 54 56.20 1.52 11.15 56 12.01 34 ee 
10 98 62.60 1.04 10.25 91 13.02 .20 1.91 
11 75 59 .90 1.14 9.85 77 13.03 24 2.10 
12 41 62.25 1.38 8.85 41 12.73 .38 2.37 
13 16 64.35 2.92 11.65 17 12.65 51 2.11 

12-147 68 68.90 1.29 10.63 66 15.23 1.80 1.47 

Adult 72 77 .36 1.88 9.42 66 15.96 16 1.33 

Test III Test 1V 

6 38 57.00 2.91 17.95 48 44.90 1.81 12.50 
7 38 60.40 2.59 16.00 54 52.10 1.41 10.60 
8 74 63.30 1.75 15.05 78 52.85 1.11 9.85 
9 52 65.30 2.01 14.50 52 53.85 1.28 9.30 
10 86 72.80 1.46 13.65 83 58.25 1.14 10.35 
11 75 75.85 1.38 12.95 74 61.70 1.39 11.95 
12 41 71.90 2.12 13.55 40 59.35 2.07 13.10 
13 17 67 .80 4.04 16.65 16 55.60 2.45 9.80 

12-14 + 65 81.73 1.09 8.82 66 64.23 1.32 10.74 

Adult 72 82.32 | 10.45 68 79.57 .96 7.94 


* Range on Test II is 0-18. 


t Lincoln Junior High School Pupils—-Compare Table VIII for results from another 
group. 
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from the Lincoln Public Schools, and the adult group from the 
University of Nebraska. 

In a second series of experiments the study was carried into 
the upper age groups in the Junior High Schools of Lincoln and 


given to a second University of Nebraska group. These results 


are shown in Table VIII. 


Taste VIII. Showing averages of junior high school* and adults for all tests 














Test I Test II 
Age- cr oe ~ r 3 ~ 
Group N Av. o S.D. N Av. o S.D. 
13 53 70.35 1.46 10.69 56 13.93 18 1.34 
14 55 69.45 1.09 8.08 54 14.67 .29 ae 
15 60 70.65 1.22 9.47 61 15.03 20 1.55 
16 21 70.22 1.49 6.86 17 16.12 .25 1.02 
Adults 60 74.20 1.40 10.86 59 16.15 3a 1.31 
Test III Test 1V 
13 54 80.13 1.45 10.69 50 65.48 1.77 12.48 
14 56 83.13 1.41 8.91 59 66.18 1.13 8.67 
15 61 81.57 1.31 10.33 56 66.75 1.55 11.57 
16 20 82.80 1.56 6.96 17 66.96 2.34 9.67 
Adults 63 80.99 1.96 15.53 63 78.32 1.03 8.18 


* Lincoln, Nebraska, schools. 


The test was also given to groups of children in the Des 
Moines Public Schools, Des Moines, Iowa.’ These results are 
shown in Table IX. 


TABLE IX. Showing averages—Des Moines tests 




















Test I Test II 
Age- cr x aay cr —A —* 
Group N Av. o S.D. N Av. og S.D. 
10 60 60.17 1.29 9.99 60 14.20 .20 1.56 
11 36 61.83 1.48 8.82 36 3.81 a 1.91 
12 7 56.21 2:55 6.75 7 12.57 .90 2.38 
Test III Test IV 
c “~ =, rs —_—_ —~ 
10 58 75.34 1.61 12.25 57 57.29 1.42 11.46 
11 35 72.79 1.81 9.75 36 56.22 1.90 11.42 
12 § 53.50 7.63 17 .06 6 49 .07 2.48 6.07 


The significant correlation between age and test score supports 
the theory that a growth factor is involved in the empathic 


11 Given by Dr. Meier. 
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response. At upper levels there is in general a leveling off in age- 
groups, a possible exception occurring in Test II. 

Constancy of response. Table IV showed that the average 
score was raised upon repetition of the tests. A third test, how- 
ever, failed to raise the scores when the o was taken into account. 

Analysis of responses. A study of the responses revealed 
many individual differences. While one kindergarten child, 
known for his “ great promise in artistic work” made scores 
equal to those of adults, many adult subjects made scores no 
better than those of children. The curves of each age-group 
would spread themselves over almost the entire range, being 
skewed to the right or left according to the age-group. There 
were no sex differences. The unique responses previously men- 
tioned under scoring methods were noted. In Test III many 
children interpreted the lines as doing and saying things, 1.¢., this 
line had fallen down and hurt itself, efc. It is an easy transfer 
from the empathic to the imaginative element,—the latter had to 
be used to infer the presence of the former. 

One interesting feature of Tests III and IV was the dual 
aspect of the empathic response given by many children in the 
lower grades. The children’s classification of the item, as 
pleasant or unpleasant, agreed with that of the adults, a finding 
in support of certain genetic theories of the emotions. ‘The first 
rough classification involved such word-pairs as happy-sad, 
happy-sick, good-bad, or glad-sad. Older children introduced 
more words classifying all lines as happy-sad, mad-afraid, good- 
bad-sick, or glad-sad-scared-mad. To these three and four 
dimensional classifications other words were added so that by 
the fourth grade, finer differences were being made between the 
reactions and the rough two-dimensional classification had dis- 
appeared. There were many instances, especially in the lower 
grades, when a stimulus failed to arouse a response. This hap- 
pened with some of our ‘ best’ subjects, 1.e., subjects responding 
readily and giving evidence of a real empathic nature. 

Test IV elicited the assumption of facial responses and bodily 
attitudes by many of the subjects. When pictures depicting fear 
or surprise were shown there was a noticeable lifting of the 





_— 











58 WILLIAM E. WALTON 


eyelids, an exposure of the whites of the eyes. The anger, scorn, 
and mistrust pictures usually resulting in a knitting of the eye- 
brows, a stern appearance of the face, and the slightest tensing 
of the muscles. A relaxation of the facial muscles followed the 
showing of the pleasant pictures—joyousness, adoration, inter- 
ested attention, and pleasant surprise. The reactions were imme- 
diate and did not last very long. 

Statistical study of responses to each item. The average 


score, its sigma and S.D., are given for each item of the four 
tests in Table X. 


TABLE X. Showing results of analysis of all items and all tests 








Test I (N=524) Test II (N=526) Test III (N= 475) Test IV (N=476) 
= ai init ipisias ae Sees 





r 


Av. gav. S.D. %Right % Wrong’ Av. eav. S.D. 


6.42 .32. 2:75 69 31 4.40 .13 2.93 3.12 .04 
5.24 .10 2.25 68 32 4.76 .11 2.42 3.40 .06 
4.03 .12 2.79 77 23 4.68 .13 2.90 3.17 .06 
4.42 20 2m 84 16 < Gime 2.56 .05 
4.2: 2 2.a 82 18 4.81 .11 2.36 2.43 .04 
4.47.13 2.87 76 24 5.28 .13 2.85 1.66 .08 
4.65 .10 2.24 85 15. 4.60 .11 2.57 2.60 .07 
tm: 12 2 30 70 4.50 .13 2.80 4.76 .04 
4.90 .11 2.56 84 16. a .13 275 1.66 .09 
4.45 .11 2.59 54 46 has Ae a8 3.62 .09 
3.98 .11 2.51 43 57 4.19 .13 2.89 2.48 .08 
4.00 .10 2.38 79 21, 5.07 .lt 2.D 2.80 .07 
84 16 4.87. .12 2.63 2.11 .07 

wy Range 58 42 ae. 23 1.34 .06 
0-8.3 31 69 3.10 .06 

ae 2 os ee 75 25 1.94 .10 
IV 0-5.0 31 69 3.81 .08 
60 40 1.84 .07 

3.98 .07 

1.68 .08 


1 
1 
1 


— 


Items 1 and 2, lines drawn in response to the words ‘ quiet z 


and ‘ angry’ seem too easy. Item 11 seemed low inasmuch as its 
direct opposite preceded it. Test II seemed much too easy for 
diagnostic purposes. This may have been due to the fact that 
it was a dual choice rather than a multiple choice situation. 
Test III showed high but not excessively high averages. Test 
IV showed six low mean scores, six which were average and 
eight which were high, one excessively so. This item is the 
picture entitled, joyousness, designated by most children as 
happy. It seemed to have been very commonly understood and 
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was consequently not diagnostic. Anger was much less com- 
monly understood. Some of the items in the test were too 
difficult. 

Artistic ability and empathic response. The artist may create 
by using lines which he has been taught will create a certain 
impression or he may use lines dictated by his own feelings. Two 
schools of pedagogical methodology have arisen concerning the 
desirability of these two alternatives. Children are better sub- 
jects than adults for the acquisition of information concerning 
this question as direct or indirect training has had less oppor- 
tunity to influence their behavior. Accordingly, the ‘X’ and 
‘Z’ subjects used in previously reported experiments ?* were 
compared as to performance on the tests. There were only seven 
subjects available of the artistic group, six of which took all four 
tests. Of the nine non-artistic children, eight took the four 
tests. The results are shown in Table XI. 


TABLE XI. Comparison of X and Z subjects * 








Chances 
Artistic Group Non-artistic Group in 100 

c W— ~ rc A i of a true 

Comparison a. eee 8 S.D. N_ Av. o S.D. difference 
Test I +, CaS 2.06. «5.42 9 64.00 3.17 9.52 65 
Test II, Pt. 1 7 8.78 .22 .65 9 9.42 .68 1.80 76 
Test II, Pt. 2 7 4.57 45 1.18 v) 4.44 ae 1 58 
Test II 7 14.00 95 2.51 > Ba Pee eS 73 
Test III, Pt. 1 6 48.00 2.70 7.64 8 45.63 3.86 9.45 69 
Test III, Pt. 2 6 2D 2.8 5.82 8 24.13 3.02 8.54 92 
Test III 6 75.83 5.94 14.55 8 69.38 3.12 8.82 83 
Test IV 6 57.50 5.98 14.62 8 60.63 3.24 9.16 67 


* See footnote 9. 


With the exception of one part of one test the differences are 
all in favor of the artistic children. The differences are not very 
significant, however, only one approaching the point of relia- 
bility. Probably the excellence of the school training which 
these children receive tends to overshadow certain natural tend- 
encies and to make the responses to these tests more uniform. 

Twenty-six children, considered by the art instructor to be 
most talented, and 20 children, considered least talented in her 
classes of 300 children in the Harrison School, were compared 


12 See footnote 9. 
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Chances 
Artistic Group Non-artistic Group in 100 
- A a r \ of a true 
Comparison N_ Av. o S.D. N_ Av. o S.D. difference 
Scholarship 24 76.80 1.19 5.82 oe pe.a0.. $35 ° 5.21 64.0 
Test I 26 67.70 2.50 6.70 20 58.00 2.57 11.50 99.7 
Test II 26 13.66 .40 2.04 me 12.80 = =««.55 «2.45 89 .2 
Test III 26 73.45 3.07 15.65 a 8.75 «3.35 14.95 79.1 
Tast IV 26 64.40 2.14 10.90 20° 37.50 2:75 12.320 97 .6 
* Harrison School, Cedar Rapids, Iowa. 
TABLE XIII. Comparison of artistic and non-artistic groups 
| Chances 
No. Artistic Group Non-artistic Group in 100 
r ate Fr * ~ r ts ~ of a true 
Average on Art. Non-art. Av. goav. S.D. Av. oav. S.D._ difference 
Test I 34 34 67.94 1.64 9.58 69.41 1.98 11.57 65 
Pt. 1, Test II! 33 33 Weis | sae 3.2 92> -;20 548 62 
Ss Pt. 2, Test II? 33 33 ee. ee 5.09 .08 43 100 
+ Comb. Test II 33 33 ioe . aoe haee 14.97. 2° OA 94 
a Pt. 1, Test Il 33 33 54.02. 1.2 7.23 51.24 1.44 8.26 93 
 .2, Tetili® 33 33 29.73 ..85 4.90 2.91. .44 423 56 
Comb. Test III 33 33 84.02 1.69 9.73 81.32 1.78 10.23 86 
Test IV 33 33 70.99 1.54 8.83 68.56 2.13 12.23 82 
Test I4 34 72 76.09 1.07 6.70 76.81 1.11 9.40 68 
Pt. 1, Test II 36 66 tek eet eee wm. 1 1.2 99 
as ke eee 36 66 S| i: Geen 5.46 .08 .68 62 
= Comb. Test II 36 66 6.04 12 4..7 15.96 i. 423 99 
3 Pt. 1, Test III 35 72 55.59 1.60 6.27 51.29 1.63 2.74 99 
< Pt. 2, Test ITI 35 72 32.71 .40 2.40 30.14 .56 4.70 100 
Comb. Test III 35 72 88.22 1.21 7.19 81.60 1.23 10.45 100 
Test IV 36 68 84.33 .91 5.49 9.41 66% 100 


60 WILLIAM E. WALTON 


as to their ability on the tests. Intelligence test-scores were not 
available and testing was not permitted. Assuming, however, 
that general scholarship correlates with intelligence, a compari- 
son of general scholastic averages of the two groups was made, 
which indicated that the two groups were equal scholastically. 
Differences in empathic scores were all in favor of the artistic 


children and, with the exception of Test III, were significant. 


Table XII summarizes the results of these comparisons. 
To secure children of a higher age group, art instructors of the 
Lincoln Junior High Schools selected the best and poorest chil- 





TABLE XII. Comparison of artistic and non-artistic children selected from 300 subjects * 


1 Range: 0-12, 
2 Range: 0-60. 











3 Range: 0-41.5. 


4 See Table. 

















TABLE XIV. Showing results of an analysis of all items, Test I, for the good in 
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dren from an artistic point of view, matching the two groups for 
intelligence as far as possible. Thirty-four children took Test I 
and thirty-three Tests II, II] and IV. These results are shown 
in Table XIII. The differences were significant in only three 
instances. All but two of the eight measures, however, showed 
differences favoring the artistic children. It may be that by the 
time school children reach this age the major differences have 
been eliminated through their art training, perhaps indirectly by 
the setting up of favorable conditions in which there is use of 
expressions such as ‘angry’ line, ‘happy’ line, etc. 

The tests were also administered to two adult groups, one 
selected by an instructor of Fine Arts, the other constituting an 
Arts and Science group. The results, shown also in Table XIII, 
show differences in favor of the artistic group, with the excep- 
tion of Test I. There is also uniformity between the groups on 
Part 2 of Test II. In the other six instances our results show a 
significant difference in favor of the artistic group. 

All of the items used in the tests were analyzed further to learn 
which were more diagnostic of artistic ability. Tables XI\V- 
XVII show the results. These tables also reveal those items 
which were diagnostic in the lower grades but not in the upper, 


art and the poor in art 











Grade School Children 1 Junior High School 2 
A 3 
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1-6 See footnotes, Table XIV. 
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‘Tas_eE XVII. Showing results of an analysis of all items, Test IV, for the good 
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those diagnostic in the upper grades but not in the lower, and 
those diagnostic in both. Items 4, 6, and 8 of Test I, items 1, 
7-10, 12-16, and 18 of Test II (in fact the whole test), items 2, 
4, 8, 10-12 of Test III, and items 2, 5, 9, 15, 17, and 19 of 
Test IV could be eliminated from the tests and all of the remain- 


ing items combined into a single test. 


Relationships between tests. To determine if the tests actually 
succeeded in measuring the same thing, coefficients of correlation 


TABLE XVIII. Correlations between various tests 
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Test I-II : 480 .328 .028 
Test I-III 440 .315 .028 
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between the various tests were computed. ‘These relationships 
are presented in Table XVIII. 

Test I seems to bear less of a relationship to the other three 
tests than they do to each other. In general the correlations are 
sufficiently high to warrant the assumption that to a fairly high 
degree they are measuring the same thing. 

Relationship to other factors. Since a larger number of the 
children of Harrison school had been given a color harmony 
test,’ the correlation was found between this test and Test: I. 
The result, shown in Table XIX, indicates no relatio1iship what- 
ever. ‘To determine if the color reactions were based on a sense 
of fitness rather than on a feeling for the color, the color har- 
many scores were correlated with the results of Part 2 of Test IT 
yielding the result shown in Table XIX. Inasmuch as there was 


TABLE XIX. Show-ng correlations with scores on Color Harmony Test 


Comparison of N r P.E. 
C H—Test I 194 .163 .046 
C H—Pt. 2, Test II 223 .058 .045 


little relationship between single color preferences and sensitivity 
to color harmonies,’* it would seem that the various color experi- 
ences are quite distinct. 

Correlations between grades; made in school and Test I and 
between grades in art and Test I indicated no significant rela- 
tionship. These are presented in Table XX. There were no 
significant relationships to sex and to intelligence. 

Genetic development of empathy. In addition to the statistical 


TABLE XX. Showing correlations between grades in school and Test I 


Comparison of N r P.E. 


Test I—Grade in Art 225 .094 .045 
(lower grades ) 
(I B-IV B incl.) 


Test I—Grade in Art ne... .093 .047 
(upper grades) 
(IV A-VIA incl.) 


Test I—AIl Grades 436 .098 .037 
(upper and lower ) 


13 Walton, W. Sensitivity of children and adults to color harmony, Psychol. 
Monog., 45, 1933, No. 1- 51-62. 
14 Tbhid., p. 60. 
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results of the investigation presented heretofore, observation of 
the performance of children of many ages in the test situation 
led the author to suggest the following theory of the genetic 
development of empathy. 

Results from the preliminary tests given to pre-school chil- 
dren suggest that very young children feel themselves into their 
entire environment. It may be that, beginning soon after birth, 
children develop a process of abstracting themselves from their 
environment, feeling themselves less and less into their surround- 
ings. The empathic tests tap these more primitive methods of 
reacting which, although they undoubtedly play some part in 
adult daily behavior, are more subtle in their influence. It is 
a common observation that very young children are ‘ bubbling 
over’ with ways in which the aeroplane, automobile, train or 
tractor ‘go.’ Voice, arms, legs, and indeed the whole body 
are used to describe one of these vehicles. Even the mention 
of the name is sufficient to start these physical processes. In 
adults abstraction has taken place so that they no longer feel 
themselves into the situation to such an extent, but in all prob- 
ability the name of an object will arouse incipient movements. 
To a lesser extent this is true of adjectives. 

Children do not, however, have very well established commu- 
nities of ideas. This would account for the low scores of very 
young children. To the pronounced word angry the child grasps 
his pencil, frowns, tenses his muscles and vigorously makes a 
wavy line. The empathic response is present but may be poorly 
expressed. However, his responses more and more resemble 
those of other children, and as we proceed up the age scale we 
gradually approach adult performance. In all probability a 
great many factors are operative to determine this growth,— 
certainly age is not the only factor. Accounting for the growth 
of the response on the basis of the acquisition of a community 
of ideas is probably not adequate unless it be assumed that 
children’s experiences are the same. 

V. Conclusions. A battery of four tests was constructed, 
designed to measure the empathic response in children and adults. 
Test I involved the drawing of lines in response to adjectives. 
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Test II was a dual choice situation in which lines and colors 
were shown the subjects and they were required to encircle the 
correct word. Test III called for a spontaneous response to 
lines and colors, the response involving a kind of introspective 
analysis of the subject’s reactions toward lines and colors. Test 
IV employed 20 of the Ruckmick pictures of facial expression 
and followed the Gates technique for the instructions. The 
results obtained by giving the tests to kindergarten, grade school, 
and junior high school children, college students, groups of 
artistic and non-artistic subjects, lead us to the following 
conclusions : | 


1. The empathic response begins at a very early age and may 
manifest itself in the reactions of kindergarten children. 

2. Growth is characteristic of this behavior and is gradual in 
nature indicating a certain constancy. 

3. Individual differences appear as early as kindergarten age 
and are manifest in all age-groups. They concern themselves 
for the most part with the ability to make the response. 

4. Unique responses, 1.e., those differing radically from the 
typical and in many instances probably involving a great use 
of the imagination, an accompaniment of the response, are found 
in all tests (except Test II) and at all age-levels. 

5. Young children show a tendency to think in terms of 
pleasantness-unpleasantness, utilizing a dual type of response. 
Gradually they add other dimensions until a large range of 
responses is acquired. 

6. Bodily reactions accompany the empathic response, as 
revealed by facial expressions and bodily tensions. 

7. In general there seems to be some relationship between 
artistic ability and the four tests; exceptions are present in 
certain groups. 

8. There is a definite relationship between the various tests. 
The tests are fairly reliable, but not as high as is desirable. 

9. Grades in school, intelligence, grades in art, reactions to 
color harmony tests and sex do not seem to be related to the 
empathic response. 


10. The results are suggestive of further work and introduce 


‘ 








a 
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a possible diagnostic factor in the study of artistic and non-affistic 
children. ~ 
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THE VALIDITY AND RELIABILITY OF A MODIFIED 
FORM OF THE McADORY ART TEST FOR 
USE AT LOWER GRADE LEVELS’ 


by 


Mitprep Dow Voss 


Introduction. An investigation of the development of es- 
thetic appreciation carried on in the second, third, fourth, and 
fifth grades, required an art judgment test which could be applied 
satisfactorily to these grade levels. The McAdory Art Test* 
is standardized for public-school ages and grades above Grade 
Il]. Preliminary trial indicated that the subject-matter of this 
test seemed suitable for lower grades. Accordingly an attempt 
was made to determine if the standardized form or a modified 
form of this test might be suitable for use in the second and 
third grades. . 

Sampling. The subjects jused in this study represented the 
entire enrollment of the specified school grades of two Iowa 
towns; Villisca, a town of about 2,000 population, and Grinnell, 
a town of 5,000 population. The second-, third-, and fourth- 
grade results are from Villisca, the. fifth-grade results from 
Grinnell. Both towns are primarily dependent upon the sur- 
rounding rural population for existence, having few manufac- 
turing industries. Grinnell represents a greater socio-economic 
range because of the presence of a small college. It has also a 
greater amount of public-school art education. 


1 This study was made concomitantly with the writer’s other investigation 
reported in this volume and is an indirect phase of the research program, 
Genetic Studies in Artistic Capacity. Since it is a by-product of another 
investigation, the data are more limited than they would have been had the 
primary object been the standardization of the test at these levels. 

2McAdory, M. Art test. N. Y., Bur. of Publ., Teach. Coll., Columbia 
Univ., 1929. 

3 Acknowledgment is made to Superintendents H. E. Dow and C. E. 
Humphrey and to the principals and teachers of the Villisca and Grinnell Public 
Schools for their co-operation in this study. 
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Test program. In Grades II, III, and IV, four tests were 
given: (1) an introductory test on the basis of which two 
groups were formed, one of which received art training, the 
other of which did not; (2) a follow-up test to both groups, 
after which the second group received a different type of train- 
ing; (3) a second retest four weeks later; (4) a third retest 
following the summer vacation, approximately a year later than 
the original test. In Grade III, three tests were given: (1) an 
introductory test as a basis for division of groups; (2) a retest 
after a training period of three weeks for one group; (3) a 
second retest after a training period of three weeks for the 
second group. 

Table I presents the number of children in each grade, the 
number of art tests administered, the mean CA (in months), and 
the mean intelligence and MA scores (in months) derived from 
the Kuhlmann-Anderson Group Intelligence Test (4th ed.). 


TABLE I. Description of grade groups 


No. of 
No. of art tests Mean Mean Mean 
Grade children administered IQ MA CA 
II 30 103 102.5 91.1 90.1 
Iil 40 154 100.4 102.1 103.5 
IV 33 123 100.7 114.9 114.5 
V 85 252 102.5 130.0 128.5 


Grade V as normative. Grade V represented the highest 
grade level in this investigation and is a year above the lowest 
grade on which the test is standardized. It was assumed, because 
of the McAdory standardization, that the test is reliable and 
valid at this level. The measures of reliability and validity 
obtained from the administration of the test at the fifth-grade 
level were used as a norm. It was proposed to test whether 
the standardized form of the test continued to give like measures 
of reliability and validity as it was applied to progressively lower 
levels, or, if not, what modifications could be adopted to insure 
a comparable validity and reliability. 

Administration. McAdory allows considerable variation in 
administration of the test. The following procedure was adopted 
in Grade V as a method which retained most of the advantages 
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of individual testing while also utilizing the time-saving advantage 
of the group test. 


The children were seated in rows in.their customary classroom 
manner. Each child was furnished with-the standard record 
blank and was given an equal number of the test plates, the 
number being determined by the number of plates divided by 
the number of children in the group to be tested. Care was 
taken that the plates were in consecutive order, 1.¢., the first child 
had Plates 1 to 7, the second child, Plates 8 to 14, etc., and that 


this order was retained throughout the test period. The 
following directions were given: 


Find the record sheet on your desk like this (holding up record sheet). Put 
your name on the line after ‘Name.’ Put ........ (number of the grade) on 
the line after ‘Grade, and ........ (name of school) on the line after ‘ school.’ 
(The date was stamped in by the experimenter and the birth date was secured 
from school records.) 

I have given you some cards. Each card has four pictures on it. Do you see 
that each picture is different? In this test you are to look at each picture very 
carefully and decide which picture you like best, which one you like second best, 
and which one you like third best. 

Let me show you how to do it with this card (hold up Plate 71). See how 
each picture is different (point out differences). At the top of this card is the 
word ‘ Plate’ and a number. Find this number on your record sheet (hold up 
record sheet). It is in a little square and under it are four lines. The first 
line begins with the word ‘ Best.’ Suppose we say that we like B best (point 
out on plate). We put the letter B on this first line under number 71. Every- 
body do this. (Cover up B on Plate 71): Now which of these three pictures 
do you like best? Let us say that D is best. Put a D on the line beginning 
with the words ‘ Next Best’ on your record sheet. (Cover up D): Now which 
of these two pictures do you like better? Let us say that A is better. Put A 
on the line beginning with ‘Third Best’ on your record sheet. Now only C 
is left so that we put the letter C on the line beginning with ‘ Worst’ on the 
the record sheet. You see the picture we liked best is on the line named 
‘Best,’ the picture we like next best is on the line named ‘ Next Best,’ the 
picture we like third best is on the line named ‘ Third Best, and the picture 
that we did not choose is on the line named ‘ Worst.’ Now let me see if your 
paper is correct. (Check each child’s paper.) 

Here is Plate 36. Let us say that we like D best, C next best, A third best, 
and B worst. (Write on blackboard.) Put this answer where it should go on 
your record sheet. (Check each child.) 

Now take the clip off your cards and lay it on your desk. Leave it there 
until I tell you to use it again. 

Each of you has different plates. Look at the top of each plate and find the 
number; then find this number on your record sheet. Put your finger on this 


number so that you will not lose it. Look at your pictures on the first plate. 
See how each one is different; then choose the picture you like best. Put the 
letter of this picture on the first line. Choose the picture you like next best. 
Put the letter of this picture on the second line. Choose the picture you like 
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third best. Put the letter of this picture on the third line. Then put the letter 
of the picture that is left on the last line. Then go on to the next picture. 
When you have finished all of your pictures, turn them over on your desk so 
that the white side is up. 

(After each child has finished): Leave your record sheets lying on your 
desks and pick up your plates. Now all of you have finished your pictures. 
We will have to change so that each person has new ones. Listen carefully 
and do just as I tell you. Each person in this row, pass your pictures to the 
one back of you. The person.at the back of the row brings his pictures to the 
front. Now he gives his pictures to the first person in the second row. Now, 
everyone in the second row, pass your pictures to the one back of you. The last 
one in the second row brings his pictures to the first one in the third row. 
Everyone in the third row passes his pictures back. The last one brings his 
pictures to the first one in the fourth row. Now see if this row can do it. This 
row. Now I will take the pictures of the last one in the last row and bring 
them over to the first person in the first row. Now you have new pictures and 
can start again. 

(After each child has again finished) : Let us see if you can change pictures 


correctly. First row, change pictures. Second row. Third row. Fourth row. 
Fifth row. 


(After each child has again finished) : Change plates. 

(At conclusion of class period): We will have to stop today. Put your 
record sheet on top of your pictures and clip them ‘together with the clip on 
your desk. Now wait until I have completely finished telling you what to do. 
I want you to pass your papers forward up to the first of the row. Wait. As 
you take the papers from behind, put your paper on top. You (to the last 
child in row) pass your paper forward. Now you (second child from back of 
row) put your paper on top and pass them both forward. You (third child) 
put your paper on top of both. This is very important or the papers will be out 
of order for us tomorrow. Have your paper on top before you pass the papers 
on. Now, second row, pass your papers. Third row. Fourth. (Collect papers 


being careful to keep them in order.) Tomorrow we will go on from where 
we stopped today. 


(Testing time was broken into three half-hour periods on successive days. 
Care was taken not to hurry the children.) 


Adequacy of the test at the fifth-year level. McAdory gives 
norms for each age and grade based on scores made by New 
York City children.* The mean score of grade V-B girls was 
103.3; the McAdory norm for grade V-B girls is 106. ‘The 
mean score of the fifth-grade boys was 93.9; the corresponding 
McAdory norm for grade V-B boys being 102. To determine 
if a discrepancy of age accounted for these lower scores, the 
McAdory norm for grade V-B girls of a mean CA, correspond- 
ing to the mean age of the Grinnell girls, of 10 yrs. 6 mos. was 
determined as 114.0; and the McAdory norm for grade V-B 


4 Siceloff, M. M., Woodyard, E., et al. Validity and standardization of the 


McAdory Art Test. N. Y., Bur. of Publ., Teach. Coll., Columbia Univ., 1930, 
pp. 10-16. 
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boys of a mean CA, corresponding to the mean age of Grinnell 
- boys, of 10 yrs. 9 mos. was determined as 105.0. The Grinnell 
scores compare even more unfavorably with the norms when age 
is considered. However, it must be remembered that the 
environmental discrepancy is undoubtedly great. 

The McAdory Art Test is divisible into two comparable forms ; 
Plates 1 to 36 comprising Form A, Plates 36 through 72 com- 
prising Form B. The reliability of the test for Grade V was 
determined by correlating the score on Form A with the score 
on Form B. The resulting coefficient of .71+.02 became, on 
application of the Spearman-Brown prophecy formula, a relia- 
bility for the whole test of .83. Carroll and Eurich report a 
reliability coefficient, obtained by similar means, of .82 for 100 
college students.” 

As one group received no training between the first and second 
test, a second measure of reliability was determined by correlating 
each subject’s score on the initial test with his score on the retest 
three weeks later. For 41 children, r was .87+.03. 

McAdory°® quotes correlations between initial and retests on 
groups receiving instruction in the interim as measures of relia- 
bility. The correlations were as follows: 101 art school stu- 
dents retested after one week, .73+.04; 95 art school students 
retested after two months, .63+.05; 54 women college students 
retested after a week, .73+.05; 47 women college students retested 
‘after a year, .59+.07. A similar coefficient was obtained for 
the subjects in this investigation by correlating the first and 
second test of the group receiving instruction. The resulting 
coefficient of .64+.06 for 44 subjects compares favorably with 
the reliabilities quoted above. 

Modification of administration. From the foregoing, it would 
seem safe to conclude that the McAdory Art Test in its stand- 
ardized form is as reliable at the fifth-grade level as in the higher 
age-levels. The problem now was to determine if the standard- 
ized form could be applied without loss of reliability to lower 
grade levels or if modifications of administrative technique 


5 Carroll, H. A., and Eurich, A. C. Abstract intelligence and art apprecia- 
tion. J. Educ. Psychol., 23, 1932, p. 216. 


6 Siceloff, M. M., Woodyard, E., e¢ al., op. cit., p. 27. 
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were necessary. [wo modifications seemed possible to increase 
the reliability of the test should it fail to be applicable in the 
standardized form: (1) administration as an individual rather 
than as a group test; (2) diminution of the difficulty of separate 
items by eliminating the more difficult intermediate choices and 


requiring the child to choose only between the best and 
worst items. 


To determine the effect of individual testing as opposed to 
group testing, a modification of procedure was followed in the 
administration of the test in the fourth grade. On the first 22 
plates of the test the child was tested individually, on the remain- 
ing 46 plates he was tested in a group as in Grade V. Following 
is a description of the modified procedure on the first 22 plates: 


(There was first a brief preliminary exercise from the Lewerenz Tests in 
Fundamental Abilities of Visual Art, Test 1, Recognition of Proportion, Items 
1 and 2.) I have some pictures I want to show you today. Here is one of 
them. There are four cups and each cup is different. You are to choose the 
cup that you think is the best looking, the one that you would like to have most. 
Which of these four would you choose? Now suppose I cover this cup (the 
one just chosen). That leaves these three. If you could have just one of 
these cups that are left, which would you choose? (Cover up one chosen) : 
Now which of these two cups that are left would you choose? 

Now do you see what we have done? We have picked the one you liked the 
very best (pointing), then the one you liked second best (pointing), the one 
you liked third best (pointing), and that left the one you liked least of all 
(pointing). We are going to do that with all of these pictures. It is a kind 
of a game. Haven’t you often picked out pictures you liked best in magazines 
or the kind of cars you liked best? That is what we are going to do here. 

Now let’s try another one. Which of the bowls do you like best? Second 
best? Third best? Look at each one carefully. 

(The McAdory plates were then presented with the following introduction 
accompanying each. The answers were recorded by the experimenter). 

Plate 1: Here are four sets of letters. If you look carefully you will see 
that each set of letters is printed differently. Which of the four sets is printed 
the best? (Cover up one chosen) : Now which of these three sets of letters is 
printed best? (As before) : Now which of these two sets of letters is printed 
the better ? 

Plate 2: Here are four tables. Suppose you were buying a new table for 
your living room at home. Which table would you choose as the very best 
looking table? But suppose you could not have that table, which of these three 
tables would you choose as the prettiest? Suppose you had only these two to 
choose from, which of these would you choose as the prettier table, the one you 
would like to have? 

Plate 3: Suppose your mother were to tell you that you might have a new 
sweater and the clerk showed you these four sweaters. Which one would you 
like to have? But suppose that this sweater had already been sold and you 
must choose from these three. Which one would you take then? But suppose 
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that your mother said that sweater was too expensive and that you must take 
one of these two. Which one would you choose? 

Plate 4: Now suppose we were going to put a fence around the school house 
yard. Look at these four fences and tell me which one of them would be the 
very best looking fence. But if we couldn’t get that fence, which one would 
you choose next? Suppose that the only two fences we could get were these 
two that are left. Which one of these would you choose as the better looking 
fence? 

Plate 5: Now suppose you were going to buy a new car. There were these 
four kinds that you could buy. Which would you choose first? Which would 
you choose second? Which third? Which fourth? 

Plate 6: Suppose you were going to buy a new dress (if a girl) or suppose 
your sister was going to get a new dress (if a boy). You had these four 
dresses to choose from. Which dress would you like to have most? Then 
suppose that your mother said you might have another dress. Which one of 
these three dresses would you choose then? Then suppose that your mother 
said that you needed just one more dress and no more. Which dress would 
you choose as the prettier dress of these two? 

Plate 7: Suppose your father is going to buy a new store building down 
town. There are these four buildings that he likes. Which one of the four 


do you like best? If he isn’t able to buy that one, which of the four would 


you tell him to choose next? Then which of the two that are left would be 
the better looking? So you will tell your father to buy this one first, if he 
can, because it is the best looking; this one next because it is the next best 
looking ; this one third because it is the third best looking; and not to buy this 
one at all because it isn’t at all good looking. 

Plate 8: Now suppose that your mother is having a birthday soon, and you 
want to buy a very pretty plate for her birthday present. Which one of the 
four plates would you choose as the very prettiest? Which one as the second 
prettiest? Which one as the third prettiest? 

Plate 9: Suppose that your little baby brother is going to get a new suit. 
Which suit will you choose as the best looking for him? Which suit as the 


second best looking? As the third best looking ? 


Plate 10: These are pictures for the wall. You want to get a new picture 
to hang on the wall of your schoolroom. Which will you choose as the best 
looking picture? Which as the second best looking picture? Which as the 
third? 

Plate 11: Suppose that you are getting a pitcher for your mother. Which 
will you choose first? Which second? Which third? 

Plate 12: (Omitted.) 

Plate 13: Here are some more little boys’ suits. Which one will you choose 
first for your little brother? Second? Third? 

Plate 14: These are books. This part is the front of the book and this is 
the part of the book that bends when you open it. You see that they are colored 
differently. Which cover would you rather have on your books? Which is 
the next prettiest book cover? Which is the prettier of these two? 

Plate 15: Here are some more plates. These are much like those we saw 
before only this time they are colored. Which is the prettiest plate of them all? 
Which is the prettiest of these three? Which of these two? 

Plate 16: Suppose your mother is buying a new dress. Which one of these 
four dresses would you like her to have? Which of these three dresses? 
Which of these two dresses? 

Plate 17: This is a picture of a Dutch lady. You see each picture is colored 
differently. In this picture her skirt is almost white, in this one it is darker, 
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and in this one it is very dark. You see that the walls are colored differently ; 
the hanging on the wall is different, and her cap is different in each picture. 
Which of these four ways of coloring makes the best looking picture? Which 


of these three ways of coloring makes the best looking picture? Which of 
these two? 


Plate 18: Here are four bowls, all alike except that they are colored differ- 
ently. Which do you think is the very best looking bowl, the one you would 


like to have most? Which is the second best looking bowl? The third best 
looking bowl? 


Plate 19: Here are some more dresses. Which of these dresses would you 
like your mother to have? Which of these three dresses would be prettiest 
for her? Which of these two dresses? 


Plate 20: Here are some pictures of people. You see that they did not put 
in the faces so that we have to decide which is the best looking picture from the 
rest of the picture. Which one do you like best of all? Which one do you like 
best of these three? Which of these two? 


Plate 21: Here are some rugs. Which rug would you like to have best for 


your room at home? Suppose you had only these three rugs, which would 
you like best? Which of these two? 


Plate 22: Which of these four davenports would you like for your living 
room at home? Which of these three davenports? Which of these two? 

Plate 23: Do you see that the trees and bushes are planted around this 
house in different ways? Which way do you think they are planted around the 
house best? Which way in these three pictures? Which way are they planted 
around the house better in these two pictures? 


Reliability of the standardized test in Grade IV. The first 
problem was to determine if the standardized form of adminis- 
tration of the test was as reliable in Grade IV as in Grade V. 
The reliability coefficient between the score on Plates 24 to 47 
inclusive and the Plates 47 to 72 inclusive was computed for 
each child in Grade V and in Grade [V. The assumption was 
that the comparative reliability coefficients of these exercises 
would reflect the comparative reliability of the entire test admin- 
istered in such a manner. For Grade V the reliability coefficient 
was .64+.02. Applying the Spearman-Brown prophecy formula 
for triple length, the reliability for the whole test was .84. For 
Grade IV the corresponding coefficient was .65+.04 which 

becomes .85 for the whole test. It would appear that the appli- 
cation of the test in its standardized form is as reliable in 
Grade IV as in Grade V. 

Effect of individual administration of the test on reliability. 
To determine if individual administration of the test increased 
its reliability, the scores on the first 23 plates in Grade IV were 
correlated for the first and second tests, the second and third, 
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and the third and fourth. The resulting correlation was .60+.05, 
which is raised to .82 for the whole test by the Spearman-Brown 
prophecy formula. ‘To find the reliability of a similar series of 
exercises administered in the standardized manner, each child’s 
score was determined for Plates 37 to 60 inclusive, omitting 
Plate 48, of Form B of the test. This gave a duplicate series 
of approximately equivalent difficulty. The resulting correla- 
tion was ./0+.04, which becomes .88 for the whole test. It 
would appear that administration as an individual test, at least 
in this grade, diminishes the reliability of the test rather than 
increases it. 

Effect of elimination of intermediate stage of difficulties on 
reliability. In Grades II and III the children were tested as in 
Grade IV, individually on the standardized form of the test on 
the first 22 plates. To determine if elimination of the inter- 
mediate degrees of difficulty on the plates improved the reliability 
of the test, the first 23 plates were scored for best and worst 
items only, 1t.e., if the best item was placed before the worst 
in the rating, the child received a score of 1, if the worst item 
was placed before the best, the child received a score of 0. Thus 
the possible range of scores was 0 to 68. These scores were 
also correlated for the first and second, the second and third, 
and the third and fourth tests for each grade. The results are 
presented in Table II. 


TABLE II. Comparison of the four-item form and the two-item form of 
the McAdory Art Test in Grades II, III, and IV 7 


r for four- r for r for two- r for 

No. of item form, whole item form, whole 

Grade cases 22 items test 22 items test 
IV 89 60+ .05 .82 61.05 .82 
III 112 44 .05 .70 49+ .05 74 
II 73 42+ .07 .68 47+ .07 73 


Either form of the test is of equal reliability in Grade IV, 
but in Grades III and II the two-item form of the test has 
greater reliability. 


7 These coefficients of reliability are lowered due to the fact that scores were 
correlated on tests between which a deliberate effort was made to change the 
child’s score. However, as the same influences operated on both forms, they 
may be used to determine the comparative reliability of the two forms. 
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Effect of individual administration on the two-item form. In 
order to determine if individual administration of the test 
increased the reliability of the two-item form of the test in Grade 
IV, the two-item scores on the first 22 plates were compared 
with the two-item scores determined for each child on Plates 
37 to 60 inclusive, omitting 48, on Form B in Grade IV. The 
reliability coefficient on repeated tests on the two-item form 
administered individually (Plates 1-23) was .61+.05; on the 
duplicate form administered as a group test (Plates 37-60), 
.64+.05. This result is similar to the previous finding that 
individual administration of the test appears less reliable than 
group administration. 

Half-half reliability of four-item and two-item forms. Cor- 
relation of scores on repeated tests shows that individual admin- 
istration of the test fails to improve reliability but that omission 
of intermediate difficulties does improve reliability. To check 
further on the comparative reliability of these two forms, the 
coefficient of correlation between Form A and Form B of the 
test was computed for both the four-item and the two-item 
scores. The reliability for the four-item form for Grade IV is 
.68+.03 which is a reliability quotient of .85 for the whole test. 
Again the two-item form demonstrates a slight superiority in 
reliability. 

Administration of Plates 24 to 72 inclusive in Grades II and 
III, The test was administered as an individual test in the 
second and third grades. The first 22 plates were administered 
as in Grade IV. Plates 24 to 72 inclusive, however, were 
administered in the two-item form. Each plate was covered by 
a white cardboard with openings which permitted only the best 
and worst item of each plate to be seen.* The child indicated 
simply the one he preferred of the two. Accordingly, no com- 
parison of the reliability of the individual and group form of 
administration of the test could be made in these grades. 

Reliability of modified forms of the test in lower grades com- 
pared with the reliability of the standard form in Grade V. On 


8 In this, as in all other uses of the term, ‘best’ refers to best as determined 
by McAdory. 
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the basis of the foregoing results the test form giving maximum 
reliability would seem to be the group two-item form in Grade 
IV, and the two-item form in Grades II and III. As explained 
above, relative reliability of the group or individual form in these 
latter grades was not tested. The reliability for the whole test 
in each of these grades was determined by correlating the scores 
on Form A and Form B of the test and applying the Spearman- 
Brown prophecy formula. These reliabilities are compared with 
that obtained for Grade V in Table ITT. 


Tasie III. Relative reliability of the standardized form of the test for 
Grade V and the modified forms of the test for 
Grades II, IIT, IV 


No. of Half-half Reliability 

Grade tests Form reliability whole test 
V 250 Group McAdory (4) sit & 83 
IV 123 Group McAdory (2) 68 .03 81 
Ill 153 Individual McAdory (2) 68+ .03 is 
II 96 Individual McAdory (2) 75.03 .86 


The reliability of the test used in these modified forms in the 
lower grades seems to be approximately equivalent to that 
obtained by using the standardized form of the test in Grade V, 
and apparently the test may be legitimately used as an extension 
to the lower levels as far as reliability is concerned. 

The reliability coefficients obtained from repetition of the 
test are shown in Table IV. Many of the reliability coefficients 


TABLE IV. Reliability coefficients for repetition of McAdory Art Test for 
groups with and without training 


Weeks r between Test I r between Test | 
intervening and II of group and II of group not 

Experiment between tests receiving training receiving training 
I—McA2 20 64 .06 77 .04 
II—McA? 3 87+ .03 79 .04 
Retest 20 Group E- .81+.04 
Group C 69+ .06 
III—McA‘4 3 en... 05 87+ .03 
IV—McA‘4 3 74 .05 86 .03 


quoted by McAdory and other investigators are based upon the 
repeated test performance of groups having received instruction 
in the interim. The accompanying investigation offered an 
opportunity to test the effect of intervening training upon these 
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coefficients. Accordingly reliability coefficients were computed 
for trained and untrained groups for each experiment. 

Training decreases the reliability coefficients on the test in all 
of the experimental situations except Experiment II. Either 
group of coefficients, however, is sufficiently high to compare 
favorably with the coefficients obtained by McAdory as 
previously cited. 

Reliability of the sub-tests of the McAdory Art Test. ‘The 
McAdory Art Test may be subdivided into three sections testing 
ability to recognize comparative merit | in form, bn notan, and 
in color. As results were computed for each: of these sections 
separately, it was desired to find some indication of the relia- 


bility of the score on each section of the test. Table V presents 
the results. 





TABLE V. Reliability of the sections on Form, Notan, and Color of the McAdory 


Correlation between Raised for 
Form A and B Whole Test 
No. of e —~A , ee A 
Grade Form used tests Form Notan Color Form Notan Color 
V McA (4) Group 250 65.03 .27+.03 28.04 79 64 44 
IV McA (2) Group 123 77.03 32.06 31.05 87 49 47 
III McA (2) Indiv. 153 78.02 25.05 29+ .06 88 37 45 
II Mc& (2) Indiv. 96 74.03 28.06 12.07 85 44 


Art Test as measured by the correlation between 
Form A and Form B 








McAdory”® using as subjects two art-school groups, R and S, 
101 and 95 in number respectively, and 59 persons in Hunter 
College, Group T, found the following reliability coefficients 
between the 12 plates of each section in Form A and the 12 
plates of each section in Form B: 


Group R Group S Group T 
Form 611 .577 341 
Notan .257 .238 235 
Color 421 .362 .378 


It is rather surprising that, as a whole, the correlations obtained 
in these lower grades are higher for the sub-tests than the cor- 
relations obtained by McAdory, although the reliability correla- 


9 Ibid., p. 28. 


21 
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tions for notan and color tend to be rather low. The total 
number of exercises involved, however, is not large so that high 
reliability would not be expected. 

The reliability coefficients obtained from repetitions of the 
sub-tests are shown in Table VI. Reliability coefficients were 


computed for both trained and untrained groups for each 
experiment. 
TABLE VI. Reliability coefficients for repetition of the sub-tests of the 


McAdory Art Test for groups with and without 
intervening training 


Weeks r between Test I r between Test I 
intervening and II of group and II of group not 
Experiment between tests receiving training receiving training 
Form 
I—McA (2) 20 63 .06 73.05 
II—McA (2) 3 83 .04 76 .04 
Retest 20 Group E .73+.06 
Group C .83+.03 
III—McA (4) 3 63 .06 73.05 
IV—McA (4) 3 77.06 79+ .04 
Notan 
I—McA (2) 20 432.05 59+ .07 
II—McA (2) 3 .67+ .06 .49+ .08 
Retest 20 Group E .50+.09 
Group C  .65+.06 
III—McA (4) 3 Jane 05 59+ .07 
IV—McA (4) 3 | .60 .07 66 .06 
| 
Color : 
I—McA (2) 20 35 .09 41+ .09 
II—McA (2) 3 .63 .07 .44+ .08 
Retest 20 Group E .70+.05 
Group C .35+.09 
III—McA (4) 3 35 .09 .41+.09 
IV—McA (4) 3 47+ .09 .52+ .08 


It can be seen that training decreased the reliability coefficients 
on the sub-tests, Form and Color, except in Experiment II. The 
reversed situation is true on the sub-test, Notan, where training 
increases the reliability coefficient except in Experiment IV. 
The difference in this latter, however, is small. 

Conclusions regarding reliability. From the foregoing results 
we may conclude that the standardized form of the test is (1) as 
reliable in Grade V as in upper grades and ages; (2) as reliable 
in Grade IV as in Grade V. Individual administration decreases 
rather than increases the reliability of the test on both the four- 
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item and the two-item form. On the first 22 plates the relia- 
bility obtained by correlating tests one and two of each test 
series on the four-item and on the two-item forms is approxi- 
mately equal. On the entire test, however, the reliability obtained 
by correlating Form A with Form B is slightly greater on the 
two-item form. In Grades II and III the two-item form of 
the test has greater reliability as determined by correlation 
of the scores of the first 22 plates on the first and second tests 
of each series. This two-item form has approximately the same 
reliability as the standard form in Grade V. 

Validity of the McAdory Art Test for lower grade levels. 
The validity of the application of a modified form of the 
McAdory Art Test to second-, third-, and fourth-grade levels 
is more difficult to establish. Teachers’ ratings or art grades 
seemed unlikely criteria as these would be based upon ability 
to draw rather than to appreciate art. In Grade V the children 
were allowed 15 min. to sketch with show-card colors. ‘The 
painting scores obtained from the resulting productions were 
ranked in nine degrees of merit on the basis of good composi- 
tion by ten competent judges. These rankings correlated with 
the art appreciation score on the McAdory Art Test, .34+.006. 

The following criteria were adopted as being indicative of 
the validity of the modified form of the test: (1) a compara- 
tively high correlation of the two-item form with the four-item 
form; (2) a comparatively high correlation of the McAdory 
Art Test scores with the total of the scores on three tests of 
esthetic judgment; (3) an increase in percentage of successes 


within successive grades; (4) similar correlations with intelli- 


gence, MA, and CA on the two-item and four-item forms of 
‘the test, se 











Correlation of the two-item and four-item forms. To deter- 
mine how closely the two-item and four-item scores approxi- 
mated one another, the test was graded according to both 
methods, and correlations between these two sets of scores were 
computed for each grade. These are shown in Table VII. 

The correlation between the two forms of the test is high and 
approximately constant from grade to grade. 
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TABLE VII. Correlation between the two-item form and ihe four-item 
form of the McAdory Art Test for each grade 


Probable 
No. of r for No. of r for r for 
Grade tests whole test tests items 1-24 whole test 
V 248 88+ .09 
IV 124 .93+ .08 126 74.03 .90 
III 154 71.03 .88 
II 103 78+ .03 .92 


Correlation with the total score of a battery of tests. In 
addition to the McAdory Art Test, two short additional tests 
were administered to the children in the second, third, and fourth 
grades. ‘These included Part I, Test 1 on Recognition of Pro- 
portion of the Lewerenzg Tests in Fundamental Abilities of 
Visual Art,’® and 12 paired items of the Whorley Unity 
Scales.* In adapting the Whorley scales for the purpose of this 
investigation, item I was paired with item 7, 2 with 6, 3 with 5 
for each form of the two scales, thus forming 12 pairs of 
test items. 

These tests were combined into a total score. As the length 
of each varied, the tests were given approximately equal weight 
by equalizing the variability according to a method described by 
Garrett.’* As the respective .variabilities were McAdory, 8.88, 
Whorley, 1.84, Lewerenz, 1.99, the measures on the latter two 
tests were multiplied by 5. The weighted scores of the three 
tests were then totaled for each child and the score on each test 
correlated with the total score. The resulting coefficients for 
365 measures were as follows: 

( McAdory Art Test .79+.01 


Score on all three tests with { Lewerenz Test 77.01 
| Whorley Test 73 .02 


As all three tests purported to measure the same general func- 
tion, the fairly high agreement of the McAdory Art Test with 


10 Lewerenz, A. S. Tests in fundamental abilities of visual art. Los Angeles, 
Research Service, 1927. 

11 Whorley, K. An experimental investigation of the sensitivity of the 
pre-school child to compositional unity.’ Psychol. Monog., 45, 1933, No. 1, 
Plates I and II. 


12 Garrett, H. E. Statistics in psychology and education. N. Y., Longmans 
Green, 1925, p. 279. 
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the total score seems to indicate that the test is valid to a fairly 
satisfactory degree. 

Decreasing difficulty of test items. A third test of validity 
was secured by the determination of relative difficulty of test 
items from grade to grade. In addition to the second, third, 
and fourth grade groups, previously described, the test scores 
of 35 junior and senior high school students, and 34 adults 
were included in this portion of the study for comparative 
purposes. The percentage of correct responses of the two-item 
form of the test was secured for each plate for each grade. 
These percentages were converted into P.E. distances above or 
below the mean on a normal curve. 

Instead of giving the P.E. value for each item for each grade, 
Table VIII presents a general survey of the number of test 
items passed by more than 50 per cent of the distribution for 
that grade, the number of items passed by less than 50 per cent, 
and the average P.E. value of the test items for that grade. 


Tas_E-VIII. A comparison of the difficulty of test items in successive 


grade groups 
No. of items No. of items 
passed by more passed by less Av. 
than 50 percent of than 50 per cent of P.E. value 
Grade the distribution the distribution of all items 
II 34 34 .03 
III 38 30 —.10 
IV oe 40 28 —.2]1 
V é 57 21 ay 
Junior and senior high 
school 67 l —2.42 
Adults 68 0 —3.15 


Table VIII shows that there is progressive decrease in the 
difficulty of the test items from grade to grade. Such a decrease 
would seem to indicate that the test is validly extended downward 
for use in the lower grades. 

Correlation of art score with intelligence and age factors. A 
fourth test of the validity of the modified form of the test 
was to determine if the correlations between the modified test 
scores and IQ, MA, and CA in each grade resembled the cor- 
relations obtained using the standardized form of the test in 
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the fifth grade. The correlations for each grade are presented 
in Table IX. 


TasLe IX. Correlation of art score with other measures in each grade 


Grade Test Form IQ MA CA 
V Four-item 29+ .06 . 38 .06 19+ .06 
IV Four-item .60 .07 56 .07 —.12+.10 
IV Two-item a, 40 26 .10 a 
III Two-item .20+ .10 .24+ .09 08+ .10 
II Two-item 29+ .11 ye ed .18+.11 


The correlations obtained using the standardized, form of the 
test in Grade IV indicate that intellectual factors are involved 
to such an extent that the test is no longer a valid measure of 
esthetic judgment. The two-item form presents more satis- 
factory relationships, which in Grades II and III seem suff- 
ciently similar to those obtained in Grade V to warrant the 
conclusion that the modified form of the test does not involve 
other factors Which destroy its validity. 
~ Conclusions regarding validity. The results of the examina- 
tion of the validity of the test add additional justification for 
the use of the two-item form in the fourth grade in order to 
eliminate the high correlation with the intelligence factor. The 
other results seem, in general, to justify the conclusion that 
extension of the McAdory Art Test to lower grade levels in a 
modified form has not impaired the validity of the test. 











THE MEASUREMENT OF QUALITY IN CHILDREN’S 
PAINTING BY THE SCALE METHOD? 


by 


CAROLYN TIEBOUT 


I. Introduction. While tests of artistic appreciation have 
been devised satisfying the scientific requirement involving con- 
trol of all variables under consideration, less objectivity has 
seemed possible in the measurement of artistic ability or the 
ability to produce. An approach to problems involving the 
artistic ability of children, however, made it incumbent upon 
certain of the investigators (8, 9, 11), whose studies are reported 
in this monograph, to develop, in so far as possible, a reasonably 
reliable means for determining status in artistic production. For 
this purpose the scale of values method was chosen as offering 
the most objective and most practical treatment available. While 
this method can not eliminate the necessity for a certain degree 
of subjectivity in judgment, it furnishes the judges with a more 
or less definite set of values on which to make decisions of 
merit. The superiority of this method over the direct judging 
technique has been demonstrated by Brooks (2). Using as 
instruments the drawing scales of Kline-Carey (5) and Thorn- 
dike (10), Brooks found the inaccuracy of rating to be reduced 
by one-half of what it was when ratings were made without 
the use of a scale. A scale of values offers a further advantage 
in that, once established, it may be used for other groups of 
similar material at different times and by different persons, while 

1 This study represents a co-operative program preliminary to the investiga- 
tions of Saunders, Meier and Stinson, and Tiebout. Invaluable contributions in 


constructing the scales were made by Dr. Norman C. Meier, the director of the 
study, and by the group of research associates and assistants in the Psychology 
of Art department. . 

The whole project forms a part of the Genetic Studies in Artistic Capacity 
sponsored by the Carnegie Foundation for the Advancement of Teaching and 
financed by the Carnegie Corporation. 
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ratings without a scale, no matter how expert the judges, are 
limited to the particular data used. 

Of the available published drawing scales, namely, the 
McCarty (7), the Thorndike (10), and the Aline-Carey (5 and 
6), the latter is the most recent and advanced and is the only 
scale which includes a section on composition in which scoring 
is based on the achievement of esthetic qualities. The J/cCarty 
scale also includes compositions, but these are considered, not as 
eesthetic representations, but as indications of ability to express 
ideas vividly and accurately, and are so judged. 

II. General plan for scales. While the composition feature 
of the Kline-Carey scale was fitted to the purpose of this study, 
in which the child’s ability to achieve artistic effects was to be 
measured, the fact that the ten samples representing the work 
of Grades I to VII were arranged in a single continuum did 
not make possible as fine a discrimination as was desired between 
the products of a single grade, particularly at the lower age-levels. 
It was therefore the plan of this study to devise a new scale, 
or rather a series of scales, in which the paintings for each of 
the grades from one through seven, were treated separately. 
The color element absent in the Klime-Carey samples was also 
added since it was deemed an important one in achieving artistic 
quality. Selecting the painting of compositions and devising a 
specific scale for this activity perforce limits the measure of 
artistic ability to this type of performance. It was the opinion 
of the investigators, however, that free illustration would offer, 
for a single performance, more opportunity for creative expres- 
sion and the realization of achievement of artistic possibilities 
than other activities such as the representation of a single object, 
the designing of a border, and similar exercises. 

Following the Kline-Carey plan, certain stories were selected 
as a basis for imaginative composition. This was done to secure 
sufficient uniformity in the paintings, to facilitate their judg- 
ment, and to insure the inclusion of compositional qualities. 
Story lists of from six to twelve titles were selected for each 
grade separately after consultation with the teacher and principal 
to select those most familiar to the majority of the children in 
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the respective grades. Each child was permitted to select from 
his grade list the stories which he wished to illustrate or, if he 
preferred, he could illustrate incidents from a single story. The 
titles chosen were as follows: 


Grade I—The Little Red Hen, Old Mother Hubbard, Peter Rabbit, The 
Three Bears, Billy Goats Gruff, Little Miss Muffett. 

Grade I]—The Little Red Hen, Old Mother Hubbard, Peter Rabbit, The 
Three Bears, Billy Goats Gruff, Little Red Riding Hood. 

Grade I1I—Jack and the Beanstalk, Old Mother Hubbard, Peter Rabbit, The 
Three Bears, Billy Goats Gruff, Hiawatha. 

Grade IV—Jack and the Beanstalk, King Arthur Stories, Old Mother Hub- 
bard, Peter Rabbit, Robin Hood, The Three Bears. 

Grade V—The Trojan War, The Pied Piper of Hamelin, King Arthur 
Stories, (Rip Van Winkle), William Tell, Robin Hood, Paul 
Revere’s Ride, Winter Sports, Hannibal Crossing the Alps, Old 
Mother Hubbard, (Hansel and Gretel), (The Golden Fleece). 


Also may be substituted Billy Goats Gruff, Little Red Riding 
Hood, Little Red Hen. 

Grade VI—The Trojan War, The Pied Piper of Hamelin, King Arthur 
Stories, Rip Van Winkle, Robinson Crusoe, Gulliver’s Travels, 
William Tell, Robin Hood, Paul Revere’s Ride, Winter Sports, 
Old Mother Hubbard, (Hansel and Gretel). Also Billy Goats 
Gruff, Little Red Riding Hood, Little Red Hen (one of these 
substituted for Hansel and Gretel). 

Grade VII—The Trojan War, The Pied Piper of Hamelin, King Arthur 
Stories, Rip Van Winkle, Robinson Crusoe, Gulliver’s Travels, 
William Tell, Robin Hood. 


III. Subjects. To secure a sufficiently large and adequate 
sampling of the art products of children in the grades chosen 
for study, it was decided to use a group of approximately 100 
subjects for each grade, normally distributed as to intelligence, 
with an average IQ approximating 100. With the assistance 
of the Division of Research and Pupil Adjustment, an elemen- 
tary school, judged as best meeting these requirements, was 
chosen from the public school system of Des Moines, Iowa. 
With the exception of Grade I, however, the number of subjects 
for whom complete data were obtained in Phillips, the school 
selected, was from 20 to 30 short of that desired and the IO 
average was from two to four points below 100.7 In Grades 
II through VI subjects were therefore added from another school, 
the general IQ level of which was estimated to be somewhat 
above average. Two supplementary schools in the same city 


2 Classes for those retarded in school progress were not included. 
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were used: Hubbell, in Grades II to IV, and Perkins, in Grades 
V and VI. As in the case of the Phillips School, no selection 
of subjects was made in the two additional schools, all pupils 
in the single class used in each grade being included.* In the 
case of Grade VII, which is included in the Junior High School 
system, the desired number of subjects and IO average were 
secured in a single school, Woodrow Wilson, which serves the 
same area as the Phillips School *. 

The following table which gives data concerning the entire 
group used will show that for the most part the conditions 
stipulated at the beginning of the program were met. 


TABLE I. Data on subjects 





Sex CA fe) 

a f - y —_————----—_ 

Grade No. M F AV. o Av. og 
I 100 67 33 6-8 5 100.38 10.53 
II 106 46 60 7-9 6 98 .96 8.10 
III 103 51 52 8-9 7 98.78 9 99 
IV 100 49 51 9-10 7 99 82 11.28 
V 100 48 a 10-9 9 99.03 . 10.59 
VI 100 57 43 | 11-10 8 98.16 10.56 
VII 100 54 46 12-8 8 99 , 39 9.90 


As will be noted from the table there are an approximately 
equal number of boys and girls in each grade with the exception 
of the first where the sampling gave twice as many boys as 
girls. There is approximately one year’s difference in CA aver- 
age from grade to grade. Intelligence tests were administered 
and paintings collected from all grades at approximately the 
same time so that this constant interval in chronological age 
and school development would be kept between each grade. 


3'This class in each instance was the last half of a division of the grade as 
distinguished from the first half, or the B division. 

4 Grateful acknowledgment is due to Miss Estelle Hayden, Director of Art of 
the Des Moines Public Schools, who made the program in these schools pos- 
sible, and to Miss J. Ella Baker, principal of the Phillips School, for co-opera- 
tion in the testing program conducted in this school from which the majority 
of the subjects were drawn. Acknowledgments are also made to the principals 
of other schools in which subjects were secured: Miss Laura P. Matthews of 
Hubbell, Miss Georgia Quigley of Perkins, and Mr. Everett Davis of Woodrow 
Wilson Junior High School, and to the art teachers and classroom teachers in 
these four schools who supervised the collection of the paintings. 
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Simultaneous collection of data in each grade also made it pos- 
sible to eliminate the time variable which might otherwise have 
operated to affect results. 

The intelligence test used was the Kuhlmann-Anderson Group 
Intelligence Test (4th ed.), which was applicable to the seven 
grades and thus gave a uniform testing instrument. Further 
data on the procedure and results of the intelligence testing are 
given in the following article (11). Since the IO average for 
each grade approximates 100 the mental age averages would 
approximate those for chronological age given in the table. 

IV. Collection of paintings. ‘Three paintings were made by 
each of the 709 subjects in the study, with the exception of the 
subjects in Grade I-B who made two paintings and two crayon 
drawings. The crayon drawings were included in view of the 
first graders’ lack of experience with the medium of paint. Three 
paintings instead of one were collected from each child since it 
was desired not only to secure a large sampling of work from 
each.grade but to judge later the artistic status of the individual 
children whose products were used in the establishment of the 
scales. Complete instructions were given the class-room teachers 
in regard to control of the painting situation.” Paintings were 
made independently, no child being permitted to see the work 
of another or to receive aid from the teacher. Only one paint- 
ing was made at a time and no time limit was set since it was 
felt that a more accurate indication of a child’s ability could 
be obtained when he had ample time to perform and where a 
sampling of his work was not limited to a single period, though 
not extended over such a period as to introduce a possible growth 
factor. Materials used in each grade were as follows: Tempera 
paints in red, blue, green, yellow, violet, orange, black, white, 
and brown; paper, 16”x 22” white book stock; -and 34 in. 
brushes. Each painting was identified by means of a code 
number which indicated the grade and name of the child 
painting it. 

V. Construction of scale. ‘The first step in the process of 
scale-construction involved a preliminary sorting of the 2,227 


5 These technical aspects were supervised by Research Associate Saunders. 
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paintings thus collected so as to facilitate the procedure for the 
experts who were to make the final judgments for the scale. 
Work on the scales covered a period of two years, the first 
being devoted to the scales for Grades I through [V and the 
second to those for Grades V—VII inclusive. While the same 
procedure in developing the scales was followed in both instances, 
the personnel of the groups of judges necessarily varied. Pre- 
liminary judgments were made by four research assistants in 
the division of the Psychology of Art who, in addition to being 
trained in scientific methods, had had one to several years’ expe- 
rience with children’s art. The further qualification of art 
training was possessed by three of the judges, two in the first 
group working with Grades I-IV inclusive and one in the 
second group. 

These judges, who will be designated as departmental judges, 
were instructed to arrange the paintings for each grade in five 
categories of merit from inferior to superior on the basis of 
esthetic quality with, among other attainments and qualities, 
due consideration of esthetically significant balance, rhythm, 
color harmony, etc. Six paintings for which there was complete 
or nearly complete agreement in the ratings of the four judges 
were then selected as samples of each of the five steps of merit: 
inferior, below average, average, above average, and superior. 
In other words, there were chosen for each grade thirty typical 
paintings, divided equally among the five degrees of artistic 
quality according to the agreement of the departmental judges. 

These samples, representative of the range of artistic quality 
from definitely inferior to definitely superior in the products of 
each grade, were then submitted to a group of judges, desig- 
nated as experts, on the basis of whose ratings the scale itself 
was constructed. Fourteen judges rated the paintings for Grades 
I-IV inclusive and 17 judges, those for Grades V, VI and VII. 
Ratings by a relatively small and highly selected group of judges 
were considered a more reliable means for the establishment of 
artistic value than ratings by a larger group of less experienced 
judges. Those selected were qualified by their experience as 
art educators, with definite training in and contact with the field 
of children’s art. Care was taken not to restrict the selection 
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to any one school of thought in art or to limit it geographically, so 
that no constant influence would operate in favor of any 
particular bias or viewpoint in the consideration of artistic merit. 

These 14 judges for the earlier grades included three pro- 
fessors of art education, three directors of art in metropolitan 
public schools, four supervisors of art in a large city system, one 
assistant professor of art education, two instructors of chil- 
dren’s classes in art, one of whom is also a museum director, 
and one graduate student in vocational guidance and major in 
art in a mid-western university. Four of the “group are also 
listed in the Biographical Directory of American Artists (1). 

While the 17 judges for the later grades do not represent a 
group of as high a rank in the field of art education, they are 
definitely qualified as judges by their art backgrounds and their 
extensive experience with children’s art work. One of the num- 
ber is a director of art; five are art supervisors; seven, teachers 
of art; and four, graduate assistants with art training and 
teaching experience. 

These judges were instructed to arrange the 30 paintings in 
each grade in five ranks of artistic merit from inferior to 
superior. In judging the paintings the experts were advised 
to give due consideration, among other qualities, to the attain- 
ment of rhythm, balance, unity, and other esthetic qualities," 
rather than to technique or realistic representation. The final 
decision of each judge, according to instructions, was to be based 
on his judgment of each composition as a whole, using those 
criteria of artistic merit with which he was most familiar and 
in which he placed most confidence. Since the judges were 
regarded as qualified experts in the field a more definite descrip- 
tion of the criteria for arranging the paintings was not deemed 
necessary or advisable. Certain mechanical aspects of the judg- 
ing process were clearly demonstrated so as to facilitate the 
procedure for the judge. Complete instructions were as follows: 


To insure uniformity of conditions for judging the children’s paintings, it 
would be much appreciated if the following suggestions were kept in mind. 

There are 30 paintings for the fifth grade in folder marked Grade V; 30 
also for each of Grades VI and VII. 


6 Referring to the attainment of these qualities in esthetically significant, 
rather than in obvious, stereotyped or dully formal ways. 
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Take the paintings of one grade at a time and arrange enough space on a 
large table or two desks for five groups. Then distribute the paintings into five 
groups in accordance with artistic merit from left to right as 


Group 1 Group 2 Group 3 . Group 4 Group 5 
Poor Definitely inferior Average Definitely superior Superior 
to average to average 


It is suggested that the attainment of rhythm, balance, unity, or other esthetic 
qualities be given due consideration rather than technique or realistic repre- 
sentation. The final decision should represent the rater’s best judgment of each 
composition as a whole, using those criteria of artistic merit with which he is 
most familiar and in which he places most confidence. 

It might be found helpful to examine the entire 30 before beginning the 
sorting so as to secure some idea of their general nature. After sorting a 
greater degree of accuracy is usually obtained if each group is re-examined in 
turn. 

When each pile is sorted finally, then go through each and arrange the 
paintings within the pile in order of merit. Then tabulate. The first number 
(stamped on back) will be the best in the group, the second, the second best, 


etc. Please record legibly and accurately.? 


When all grades have been completed, shuffle all the pictures for Grade V 


together so that no trace of your ranking remains; then do the same for 
Grades VI and VII. 


In order to obtain an average of the degree of merit ascribed 
to each painting by the experts, arbitrary values of 1 to 5 were 
assigned to the five ranks or groups, one representing the inferior 
and five the superior. For the scale it was desired to secure 
paintings most nearly representative of each quality step from 
1 to 5, hence those paintings were selected the average for whici: 
equaled or closely approximated these exact numerical values. 
In its final form the scale was amplified to nine steps by the 
inclusion of paintings at the mid-point values, 7.¢., paintings for 
which the average fell at or near 1.5, 2.5, 3.5 and 4.5. At cer- 
tain points on the scales more than one painting averaged the 
required value. These duplications of example were retained as 
further aid in judgment. The total number of paintings included 
in the scales was 11 in Grade IV, 13 in Grades I, III and V, 
14 in Grades II and VI and 16 in Grade VII. In five cases 
no painting attained the necessary average value; but since these 
fell at the extremes of the scales, four at the upper and one at 
the lower, extrapolation was not difficult.® 


7 Data secured from the order of merit ranking of the judges were not used 
since practical considerations made it necessary to abandon the use of the more 
elaborate statistical treatment of the data by the probable error method 
employed by Thorndike and others. 


8 Failure to secure samples for the scale at the highest value resulted from a 
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The reliability of the judgments of the experts in rating the 
120 sample paintings for Grades I-IV inclusive is indicated by 
a correlation of .94+.01 obtained between the average ratings of 
the first half of the group of judges and the average ratings 
of the second half, while for the 90 sample paintings in Grades 
V-VII inclusive the correlation secured in the same manner was 
93.01. Using the Spearman-Brown prophecy formula (3, 
p. 269) to determine the reliability of ratings for the entire 
group of judges in both instances, correlations of .97 and .96, 
respectively, were obtained. The average of the ratings of the 
120 sample paintings by the departmental judges correlated with 
the average of the expert judges’ ratings .93+.01 and for the 
90 samples, .90+.01. These correlations are interpreted as 
indicating a high reliability for the method employed in the scale 
construction which was based entirely on the averages of indi- 
vidual judgments. In addition, they lend confirmatory evidence 
as to qualifications of the departmental judges, since these judges 
agreed almost as closely with the average rating of the experts 
and these later agreed among themselves. The correlations 
between departmental and expert judges, however, are doubtless 
higher than would result from a correlation of ratings of the 
entire group of over 2,000 paintings, since only those paintings 
were selected as samples for which the departmental judges 
agreed exactly or approximately. 

VI. Use of scales. Since we desired to know the art status 
of the subjects whose work was used in devising the scales, their 
paintings were again judged by the four departmental judges 
who for this rating employed the scales which had been estab- 
lished. The average of the values assigned by the four judges 
was taken as the score for each painting and for the final score 
of each individual an average was taken of the values thus 
assigned to his three paintings or, in the case of Grade I, to 
his two paintings and two crayon drawings. For the final judg- 
ments of the paintings using the scales, correlation between the 


tendency on the part of the expert judges, particularly in the grades above the 
second, to place a larger number of paintings in the lower than in the upper 
categories of merit. This was due probably to a tendency to judge the paintings 


according to standards achieved by children who had more specific training in 
the handling of tempera paint. 
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ratings of two judges against two judges yielded a coefficient 
of .79+.01 in Grades I through IV and of .73+.01 in Grades 
V-VII inclusive. These coefficients of correlation were raised 
to .88 and .84, respectively, when the reliability for the four 


judges was determined by applying the Spearman-Brown 
prophecy formula. 


Data on the actual results of the applications of the scales are 
included in reports of the studies in which various units of the 
original group of subjects were used. For their investigations 
Saunders (9) and Tiebout and Meter (11) included the entire 
group of 100 subjects in each grade, the former using Grades I- 
IV inclusive and the latter the complete grade range from I-VI] 
inclusive. In the Meier and Stinson study (8) certain selected 
subjects from Grades III, V, and VII were used. With the 
resulting consistency in the ratings of artistic ability and the 
testing of the same group of subjects for other related factors, 
more complete and comparable results were possible. The use 
of the scales for subjects other than those of the original group, 
as in the study of Saunders, afforded a further advantage in the 
attainment of a uniform definition of status in artistic production 


throughout the program. 
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ARTISTIC ABILITY AND GENERAL INTELLIGENCE * 


by 


CAROLYN TIEBOUT AND NORMAN C. MEIER” 


I. Introduction. In analyses of the relationship between 
various abilities and general intelligence, certain ones have 
appeared to he but slightly, if at all, related to ability in general, 
and to these the designation of special abilities or talents has been 
applied. Theoretically, artistic ability has been denoted as one 
of these, since “ artistic aptitudes appear to be controlled primarily 
by special gifts rather than by general ability’ (29, p. 505).° 
Actual evidence in support of this contention exists, however, 
only for that limited phase of artistic performance which has 
been designated as representative drawing. Furthermore, sta- 
tistical data supplying such evidence of a low degree of relation- 
ship between general intelligence and ability in representative 
drawing have been derived, in the main, from investigations 
dealing with unselected groups, such as those of Ayer (2), 
Elderton (8), Fischlovitz (9), and Ivanoff (16). These studies 
have uniformly shown low, positive correlations between 
measures of the two variables.*. 

Complete agreement with these findings does not appear 
in the results of studies dealing with subjects selected for 
superior ability in art as manifested in other activities in addition 
to representation of single objects. The investigations of 
Kerschensteiner (17), Kik (18) and Terman (38) indicate that 

1 This study is part of the research program, Genetic Studies in Artistic 


Capacity, sponsored by the Carnegie Foundation for the Advancement of 
Teaching with financial aid from the Carnegie Corporation. 


2 The first section of this study (grade school subjects) was developed by 


Dr. Tiebout; sections two and three (special subjects and artists) chiefly by 
Dr. Meier. 


8 Quoted from Murphy’s discussion of Spearman’s theory of intelligence. 
4Summaries of the findings of these studies have been made by Gooid- 
enough (12) and by Hollingworth (14). 
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general intelligence is functioning in artistic performance and, 
to such an extent, that those showing marked accomplishment in 
art are invariably of high intelligence. A less extreme view in 
regard to the importance of general intelligence for successful 
performance in drawing, and one more compatible with the find- 
ings reported for unselected groups, is expressed by anuel. He 
states that ‘ general intelligence conditions the ability of drawers 
to acquire the advanced technique into which conceptual factors 
enter and to create original drawings of merit” (26, p. 119). 
It would seem logical that a certain degree of intelligence is 
necessary for marked achievement in any endeavor involving 
mental activity, but limiting the possibility of such achievement 
in art to individuals at the superior intellectual level does not 
appear tenable, if findings showing the character of special ability 
of representative drawing hold also for artistic performance in 
general. It is apparent from this summary of findings of the 
two types of investigations that studies using more comparable 
measures of artistic ability in unselected and in talented groups, 
with adequate samplings of each, must be made before any 
degree of consistency in results can be expected. For unselected 
groups no study has yet been made of the relationship between 
intelligence and the ability involved in creation of compositions 
judged for artistic merit, and ‘similar studies with subjects at the 
high school and adult level who have demonstrated outstanding 
creative ability in art have been limited both in number and in 
scope. 

The present investigation was therefore designed first, to 
determine the relationship existing between general intelligence, 
as measured by established intelligence tests, and artistic ability, 
as measured by the achievement of zsthetic quality in compo- 
sition for a sampling of the child population in Grades I-VII 
inclusive ; and second, to determine the status of general intelli- 
gence of two selected groups, one of artistically superior subjects 
at the high school level and the other of adult artists of recog- 
nized achievement in their profession. 

II. Selection of subjects. A. Normal group. Subjects for 
the first phase of this study were 100 children in each of the 
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grades from I-—VII inclusive (106 in Grade II and 103 in 
Grade III), with an average IQ of 100 or slightly less, average 
chronological ages separated by about one year from grade to 
grade, and an approximately equal number of boys and girls, 
except in Grade I where the sampling gave twice as many boys 
as girls. 

These subjects were drawn from the public schools of Des 
Moines, lowa, a mid-western city of slightly over 140,000 popu- 
lation.” The majority of the subjects in Grades I-VI inclusive 
were enrolled in the Phillips School, but it was found necessary 
to supplement this group with pupils from the Hubbell School in 
Grades II-IV inclusive and the Perkins School in Grades V and 
VI. All of the Grade VII subjects were students in the 
Woodrow Wilson Junior High School, which serves the same 
area as the Phillips School. Complete information on the selec- 
tion of these subjects is given in the preceding article (39) which 
is introductory to this and the studies of Saunders (33) and 
Meier and Stinson (28) in which these same subjects were used. 

B. Artistically superior high school group. The artistically 
superior subjects used in the second phase of the investigation 
were selected by a canvass of the junior and senior high school 
population (Grades VII-XII inclusive) in Milwaukee, Wis- 
consin.® This locality was chosen because of the well organized 
department of art and the unusual facilities offered: in securing 
subjects whose work had been carefully followed over a period 
of years. Also, the size of the population made it possible to 
exercise high selectivity and yet secure a sufficiently large sample 
for study. Attention had been called to the superior art work of 
a number of Milwaukee school children through reference to it 
in The Art of the Child (32). Certain of these children were 

5 Acknowledgement is made to Miss Estelle Hayden, Director of Art in the 
Des Moines Public Schools, and to the principals and teachers co-operating in 
the study, particularly to Miss J. Ella Baker, principal of the Phillips School, 
where the greater part of the program was conducted. 

6 Acknowledgment is made to Mr. A. G. Pelikan, Director of Art in the 
Milwaukee Public Schools and to the art supervisors, whose co-operation made 
the prograin with the artistically superior subjects possible. Acknowledgment 
is also due to the superintendent, Mr. Milton C. Potter, for permission to test 


in the Milwaukee schools, and to the principals and art teachers at the various 
schools where the study was conducted. 
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identified as artistically superior and with this group as a nucleus, 
a list of subjects showing marked ability in art activities was 
made by the art supervisors. The subjects were divided into two 
groups, those designated as outstanding or unusual and those 
regarded as less outstanding but superior. Of the former group, 
24 and, of the latter, 22 were included in this study. This 
second group was supplemented by 26 subjects rated by their 
respective art teachers as very good. Further study and rating 
of the art products of this group added by the teachers may later 
justify the inclusion of certain of these subjects in the first group, 
so that too rigid a classification was inadvisable. 

These subjects represent a selection from nine high schools, 
two junior high schools and the 7th and 8th grades of six ele- 
mentary schools. It had also been planned to study a group of 
artistically superior subjects selected in the elementary school 
grades in order that direct comparison of the findings could be 
made with results on the normal groups in these grades used in 
the first division of this study. However, since so few subjects 
had been chosen from any one of the 20 elementary schools, 
which were located over a large municipal area, time did not 
permit the testing of this group. 

C. Adult artist group. For the study of the relationship 
between general intelligence and artistic aptitude in a highly 
selected group at the adult level, the co-operation of 200 con- 
temporary American artists ‘ was solicited through correspond- 
ence. The selection of these 200 artists was made by Dr. Meier 
on the basis of their outstanding achievement in art, as attested 
by the number and significance of honors won, and their general 
standing.® They were chosen from the approximately 5,500: 
names listed in the Biography of American Artists (1). 

Of the total number solicited, 51 of the artists co-operated by 
taking the intelligence test submitted them with the request letter. 

7 This number does not include eight artists, letters to whom were returned, 
four because of incorrect address and four because the artist was deceased. 
Dr. Meier conducted the correspondence, including the original request letter 
to the artists and follow-up letters sent in certain instances. 

8 Because of his wide knowledge of artists and of the awards made in the 


art world, Dr. Meier was highly qualified to act as a judge of artistic achieve- 
ment in making the selections for this study. 
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Of the remaining 149, replies were received from 31 artists 
expressing inability to do so for various reasons, chief among 
which were, lack of time, lack of interest or disapproval of 
assumed program, and illness. Similar reasons doubtless apply 
in the case of the majority of the 108 artists who sent no replies. 

While it did not include all of the highest ranking artists 
among the number solicited,’ this group of 51 artists was felt 
to be a fairly representative sampling of the large group of 200, 
not only in respect to rank in art but in respect to other important 
factors. The group was not limited to any one so-called school 
of art but represented a wide range of viewpoints. It also 
included artists of varying age, ranging from 28 to 82, with an 
average of 54 years, and in addition to painters, some of whom 
are also etchers, writers, or illustrators, there were among the 
number four sculptors and three caricaturists. Thus a wide 
range of interests within the larger field of art was represented. 

III, Measurement of intelligence. For the grade and high 
school subjects the Kuhlmann-Anderson Intelligence Tests were 
used for the measurement of general intelligence. This choice 
was made because the Kuhlmann-Anderson series includes bat- 
teries of tests for the entire range of subjects which were to be 
studied, hence a uniform measure could be used throughout the 
grades. Also, the Kuhlmann-Anderson’s use of the median 
mental age method of scoring in which separate mental age scores 
are assigned on each test in the battery “ shows definitely and 
accurately on what particular tasks a child scores above or below 
his median mental age (23, p. 4). Such information, it was 
thought, could be used in analyzing the relation of the general 
intelligence test results to the measures of artistic ability by 
utilizing the scores on the sub-tests. 

The Kuhlmann-Anderson series was used in the study of the 
large samplings in Grades I—-VII inclusive and in the study of the 
artistically superior subjects. In both instances it was adminis- 
tered as a group test in the classroom situation by Dr. Tiebout."” 


9 This point is discussed further in the description of rankings of these artists. 

10In one of the Milwaukee schools, Peckham Junior High School, the 
Kuhlmann-Anderson test had already been administered hence the test was not 
repeated and IQs from the school record were used. 
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For the former group the Goodenough Drawing-of-a-Man Test 
was also given to a smaller number of subjects in Grades I-IV 
inclusive, for the purpose of making comparisons between the 
relationships between artistic status and the Auwhlmann-Anderson 
and Goodenough intelligence quotients. The Goodenough 
Drawing-of-a-Man Test was not given to Grades V—VII inclu- 
sive, nor to the artistically superior group since the test is more 
applicable to the lower age groups and norms are not given 
beyond age 13, which is below that of the majority of the superior 
group. For a number of the artistically superior ‘subjects indi- 
vidual test results were secured on the Stanford-Binet Intelligence 
Test for the purpose of analyzing responses on the separate tests. 
Where two tests were administered, they were given on approxi- 
mately the same date so that direct comparisons between IQs 
could be made, and consideration of possible changes in IQ over 
a period of time would not be necessary. 

Adult artist group. In the case of the adult artists the Otis 
Self-Administering Tests of Mental Ability, Higher Examina- 
tion, Form A, were selected for the measurement of intelligence. 
This series of tests was chosen because, as the name implies, it 
is ‘ self-administering,’ the function of an examiner being merely 
to see that the subjects understand the printed directions given 
on the test form and to give the ‘go’ and ‘stop’ signals. Since 
the investigation with the artists was conducted entirely through 
correspondence it was impossible to have an examiner present to 
control the time limit and the conditions under which the test 
responses were made. This had to be left to the individual 
artist. Definite information was given in the correspondence in 
regard to control of both these factors and the artist’s attention 
was particularly called to the absolute necessity for rigid adher- 
ence to instructions given on the test blank, the time limit being 
heavily underlined in red for especial emphasis. 

Not only was it felt that this test could be completely self- 
administered with considerable success by adults, but it was 
thought that the short 30 min. time limit would be more con- 
ducive to co-operation on the part of the artists than one making 
greater demands on their time. Furthermore, this test repre- 
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sents a most significant prognostic and vital short collection of 
items, doing virtually the work of longer tests. 

IV. Measurement of artistic ability. For the measurement of 
the second factor, artistic ability, no standardized tests were avail- 
able as in the case of general intelligence. The term artistic 
ability as here used refers to ability to produce or, as defined by 
Kinter (19, p. 8), “the ability to create forms of artistic merit,”’ 
in contradistinction to artistic appreciation. With the large 
number of ‘subjects, certain limitations were of necessity placed 
upon the extent to which art production could be sampled. Since 
the aim was to measure a type of ability other than that dealing 
exclusively with accuracy im ‘representation, which had _ been 
studied by previous investigators, it was decided to have each 
subject make three paintings, or compositions, illustrating stories, 
to be judged on the basis of the zsthetic quality achieved. Since 
the only available scale for such measurement, the Kline-Carey 
Measuring Scale for Freehand Drawing, Part II (21), did not 
include sufficient samples to give the desired differentiation in 
products, particularly at the lower grade levels, a new measuring 
series was devised consisting of separate scales for each of the 
seven grades. The development and use of these scales in deter- 
mining the artistic status, or painting score, of the subjects used 
is described in the preceding article (39). In brief, the final 
painting score of each subject represents the average value of his 
three paintings as judged by the scale devised for his grade. 

Ranking of adult artists. The procedure in determining the 
artistic standing of the selected high school subjects and of the 
adult artists has already been discussed. In order to interpret 
results in the case of the adult artists, a further step was taken 
to determine more exactly their ranking in their profession. The 
names *' of 49 of the artists co-operating in the study by taking 
the Otis Test were submitted to a number of judges, who were 
requested to rank the artists in the order of their importance 


11 A total of 57 names appeared on the list, since eight artists who might 
respond to a second request were added. Two of the 51 artists returned their 
test responses after the list had been sent to the judges. 
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from the foremost artist in the list to the least important.’* In 
determining the relative importance of these artists the judges 
were instructed to consider the significant honors won and the 
probable permanent value of the work of each. The judges 
were also advised to emphasize the artist’s standing as a 
“creative artist rather than as a merely skillful technician or a 
clever painter of a given type of subject-matter such as portrait 
or landscape’ (quoting from correspondence with the judges by 
Dr. Meier ). 

Judges were chosen who were qualified to make such rankings 
because of their knowledge of the more important contemporary 
American artists in their capacities either as art museum direc- 
tors, directors of art education, or art editors. Of the four 
judges who made rankings of the artists, one submitted a list 
omitting only three of the 49, while a second omitted 18. The 
rankings of the other two judges were too incomplete to serve 
the purposes of this study. While more or less complete rank- 
ings were thus obtained from only two judges, their unusual 
qualifications would warrant the attachment of a great deal of 
weight to their choices. The correlation between the rankings 
of the two judges was .56 as computed by the method of rank- 
differences (10, p. 190). . This is equivalent to a product- 
moment r of .58+.08. Both rankings were used for correlation 
with other data in the treatment of results. 

In addition to ranking the artists whose names appeared on 
the list, both of the judges submitted supplementary lists of 
important artists who had not been included. One listed 17 
artists whose respective ranks would fall between 4 and 27 in 
comparison with those on the original list. Another named 13 
artists whom he felt should be included among the most impor- 
tant in the country. It is therefore apparent that results were 
not secured from all of the highest ranking artists, as has already 
been pointed out, but the fact that these supplementary lists are 
not more extensive would suggest that the upper extreme is more 
or less adequately represented. 


12 The selection of the judges and all correspondence with the judges were 
taken care of by Dr. Meier. 
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V. Data on Subjects. A. Normal group. Results obtained 
through the use of these measures of intelligence and artistic 
ability described in the preceding section, together with other 
necessary data, were compiled in tabular form for all groups of 
subjects used in this study. Table I presents summarized data 
for the entire group of subjects in Grades I-VII including, in 
addition to the material reproduced from Table I of the preceding 
article (39), a column giving the average of the painting scores 
as the measures of artistic ability have been designated. 


TABLE I. Summarized data on normal group 





— ci ~ Wien 
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Since the results of the subjects in Grades II-VI inclusive of 
the Phillips School have also been treated apart from those for 
the entire group, data for this group alone are given in the fol- 
lowing table. Grades I and VII are not included since these 
groups, as has been pointed out, were composed entirely of 
students in the Phillips and in the Woodrow Wilson Schools, 
respectively. 


TABLE II. Summarised data on normal group, Phillips School 








Sex CA IQ (K-A) 
a A tc OR A—_—_,, Pile bs sy 
Grade No. Mm 6U«C*r?*;, Av. o Av. o Av. o 
II 78 ao 4 7-8 6 97.68 7.80 2.63 .80 
III 75 3. 37 8-7 6 96.57 9.12 2.45 .46 
IV 70 ps ea | 9-10 5 98.31 11.49 2.56 54 
V 70 a 10-10 8 97.02 11.16 eto 51 
IV 73 40 33 11-10 8 96.06 10.05 2.25 58 


Table III gives similar data for those subjects in Grades I-IV 
inclusive of the Phillips School who took the Goodenough 
Drawing-of-a-Man Test in addition to the Kuhlmann-Anderson 
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series. Averages and sigmas for both IQs are given, ‘ K-A’ 
referring to Kuhlman-Anderson and ‘ G,’ to Goodenough. 


TABLE III. Summarised data on subjects taking Goodenough Drawing- 
of-a-Man Test 





Sex CA IQ (K-A) IQ (G) 
——_ a = A my a - —y 
Grade No. M F Av. o Av. a Av. o Av. o 
I 84 58 26 70 2 101.35 11.2 104.40 10.95 2.74 .58 
II 65 28 37 8-2 6 97.86 7.74 99.82 19.32 ‘& cae, 
III 70 a x 8-7 6 96.60 9.30 105.35 14.10 2.48 .42 
IV 58 29 29 9-9 7 98.35 11.95 103.30 16.85 2.34 .% 


Band C. Artistically superior high school subjects and adult 
artists. For the artistically superior or ‘X”’ subjects selected 
from Grades VII—XII inclusive in Milwaukee, individual records 
were compiled.”* 

VI. Results of study. A. Normal group. ‘Yo determine the 
relationship existing between results of the measures of general 
intelligence and of artistic ability for the normal group of sub- 
jects, correlation techniques were employed. Results of this 
statistical treatment of the materials are presented in the follow- 
ing series of tables. Table IV gives the Pearsonian coefficients 
of correlation between the Kuhlmann-Anderson IQ and painting 
score and, in the last column, the partial correlations between IQ 
and painting score with CA ta constant. Correlations between 
painting score and chronological age and between CA and IQ 
are also given to show the derivations of the partial r’s. It was 
necessary to rule out the factor of CA for a more accurate repre- 
sentation of the relationship between the two measures under 
consideration, since the subjects were grouped by grades, rather 
than by chronological age. 

Table V gives the same data for the results of Phillips School 
taken separately and Table VI for the results using the 
Goodenough IQ. Results from Grades I and VII are not 
repeated in Table V since they appear in the preceding table. 

An examination of both the correlations between painting 
score and IQ and the partial r’s given in Table IV reveals a range 


13 These are on file as records of the Division of the Psychology of Art. The 


artists’ records were obtained under conditions of a confidential nature and are 
consequently not available. 
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TABLE IV. Kuhlmann-Anderson IQ and painting score 


Ptg.S & 10 

Grade Ptg.S & IQ Ptg.S& CA CA & IQ (CA constant ) 
I .16.07 .24+ .06 — 522.05 35 
II .21+.06 18.06 —.47+ .05 34 
III 49+ .05 12+ .07 —.21+.06 53 
IV —.05+ .07 —.02+.07 —.46+ .05 —.07 
V .17+.07 13.07 —.51+.05 .28 
VI .22+ .06 .01+.07 —.63+ .04 .29 
VII 01.07 .10+.07 —.64+ .04 02 


TABLE V. Kuhlmann-Anderson IQ and painting score, Phillips school 


; Ptg.S & 10 
Grade Ptg.S & IQ Ptgz.S & CA CA &I1Q (CA constant ) 
II .23+.07 .22+ .07 —.61+.05 47 
III .29 .07 — .06+.08' —.46+ .06 .30 
IV —.07+.08 —.11+.08 —.57+.05 —.16 
V .01+.08 .25+ .08 —.58+.05 19 
VI .01+.08 01+ .08 —.59+ .05 .09 


TABLE VI. Goodenough IQ and painting score 


Ptg.S & IQ (G) 
Grade Ptg.S&IQ(G) Ptg.S&CA CA &1Q(G) (CA constant) 


I 31.07 . 30+ .07 —.22+.07 40 
II .12+.08 .23 .08 —.42+.07 24 
III 41.07 .00= .08 —.12+.08 39 
IV 18 .09 —.03+ .09 — .30+ .08 18 


from a slightly minus correlation of .07 in Grade IV to a positive 
correlation of .53 in Grade III. These figures fall approximately 
within the range of slightly above 0 to .50 correlation reported by 
previous investigators dealing with ability in representative 
drawing in unselected groups. Moreover, they approach very 
closely the range in correlations of .140+.070 to .510+.056 
found by Bird (3) whose recent study corresponds most nearly 
to the present one in that a standardized intelligence test was 
employed and an approximately equal number of subjects in a 
comparable, though slightly larger, age-range was used. Prac- 
tically all of the earlier studies were limited in their measures of 
intelligence to teachers’ estimates of general ability or to standing 
in school subjects, since intelligence tests had not yet been 
developed. Also the age-range studied was narrow in most 


14 The correlations cited are those for combined score on three drawings and 
Dearborn IQ. 
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cases and in some instances the number of subjects in any one 
age-group was too small for reliable results. Bird's study dif- 
fered from the present one mainly in the measure of artistic 
performance used, which resembled that of the earlier investiga- 
tions. Bird’s subjects made three drawings of a cat running 
after a ball, the first from memory, the second from a model, and 
the third from memory after the model had been removed, these 
drawings being scored for accuracy of representation and the 
portrayal of action. 

While Bird interprets the correlations found im his study as 
showing that intelligence is one of the factors which influences 
ability to draw, he points out that the degree of relationship 
between the two factors is slight when intelligence is measured 
by the Dearborn Group Test. 

In the light of the agreement between the correlations in the 
present study and those of previous investigators, particularly of 
Bird, apparently no greater relationship to general intelligence 
exists for the type of artistic ability considered in this study than 
for ability in representative drawing, the relationship in either 
case being generally positive but low. 

Further examination of Table IV shows that with the excep- 
tion of Grade IV a low positive correlation between CA and 
painting score combined with a more marked negative correlation 
between CA and IQ effected an increase in correlation between 
painting score and IQ, when the influence of age was eliminated 
by partial correlation. However, since these changes in corre- 
lation were comparatively negligible in Grades III, IV and VII 
and not considerably larger in the remaining grades, such age- 
differences as existed within a grade seem to have had little effect. 
The most marked increase occurred in Grade I, for which the 
correlation between CA and painting score was higher than in 
any other grade and which was also one of the two grades where 
this correlation was higher than that between painting score and 
IQ. This is to be expected, since the younger children in 
Grade IB have had much less experience with the art media used 
than the older children in Grade IA; many of them have made 
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their first paintings on their entrance to school.” The same 
general tendency to increase somewhat manifests itself in the 
results of partial correlations for Phillips School taken sepa- 
rately (see Table V). Almost the same degree of increase is 
noted for Grade II as in the case of Grade I, probably due to the 
operation of similar factors. 

In addition to ruling out the factor of CA, some check on the 
factor of art training was necessary and for this purpose corre- 
lations for the Phillips School are presented separately. In spite 
of the fact that the general course of art instruction is uniform 
throughout the Des Moines Public School system, possible vari- 
ation in the training in different schools may have been operating 
to affect the correlations in Grades II-VI inclusive where the 
subjects at Phillips School were supplemented by a smaller num- 
ber of subjects from a second school. Using Phillips School 
only, the training factor, although not rigidly controlled, is more 
constant for all subjects within each grade."® 

A comparison of Tables IV and V shows that in general the 
correlations for Phillips School agree with those for the entire 
group in being low and positive in all grades except IV, where 
the results are again slightly negative. The correlations are, 
however, somewhat lower in each instance, with the exception 
of Grade II where the product-moment correlation is higher by 
two points. Partial correlation raised the correlation in this 
grade considerably more in the Phillips School than in the entire 
group, so that the difference in the partial r’s is larger. The 
lowering of the correlations in Grades III—VI inclusive may have 
resulted in part from the reduction in the number of subjects and 
the limiting of the IQ range at the upper level with the use of 
Phillips subjects only, but an examination of the actual paintings 
indicates that certain aspects of the training in the schools from 
which the higher IQ children were drawn were of such a nature 


15 The fact that two crayon drawings in addition to two paintings were made 
by the subjects in Grade IB did not operate to lower their scores, since the 
average values assigned to two paintings approximates that of the average for 
both the crayons and paintings which was used as the score of each subject. 

16 Regular art classes taught by an art teacher are introduced at Grade IV. 
In the earlier grades the art activities are related to the teaching program and 
are directed by the regular classroom teacher. 
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as to cause these children to make higher painting scores. The 
Hubbell subjects in Grades II and IV, particularly in the former, 
had apparently been trained more specifically in space-filling and 
hence made more complete and unifed compositions. In the 


case of the Perkins School, Grade VI and to some extent 


Grade V, the paintings possessed certain unusual color qualities, 
the achievement of which doubtless resulted from special training 
in this respect. Only in Grade II, where such differences in 
training effects are not observable in the products of the two 
schools, was the product-moment correlation between painting 
score and general intelligence not lowered and only slightly 
altered. These Phillips correlations, together with the almost 
zero correlation in Grade VII, suggest that with a group of sub- 
jects comparable in number and IQ range to those from the 
combined schools, but with the factor of training more adequately 
controlled, lower correlations between painting score and IQ than 
those given in Table VI would result in the grades above II. 
Further examination of the partial correlations secured with 
the Phillips School data reveals that, while all of the correlations 
are low, comparatively higher correlations are found in the first 
three grades than in the upper grade, with a decided drop evident 
at Grade IV. Goodenough has pointed out that there is a close 
relationship between general intelligence and concept develop- 
ment as shown in drawing in children up to the ages of about 
ten, after which there is a decrease in the relationship, drawing 
taking on the character of a specialized ability rather than a 
general means of expression. While a high relationship to 
general intelligence is not found in any instance in this study 
which deals with a type of ability distinct from that to which 
Goodenough refers at the younger ages, the grade range in which 
the correlations are of the larger magnitude (I-III) parallels 
the ages at which Goodenough finds a high relationship to exist. 
Thus while a close relationship does not exist between ability to 
achieve zsthetic quality in compositions and general intelligence, 
as is the case with drawing used as a general means of expression 
at the younger ages, there is a tendency toward some relationship 
in the lower grades in contrast to the upper grades where the 
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correlations are approximately zero. Apparently even the type 
of ability measured in this study does not emerge as a special 
ability, almost entirely independent of general intelligence, until 
around the ages of nine and ten. 

When the measures of artistic ability used in this study were 
correlated with IQ’s obtained from the Drawing-of-a-Man Test 
devised by Goodenough to measure graphic concept development, 
low, positive correlations were again found in Grades I-III 
inclusive, and also a slightly positive correlation in Grade IV, 
instead of the slightly negative correlation secured with the 
Kuhlmann-Anderson 1Q in this grade. Since these correlations 
on the whole do not vary in any considerable degree from the 
correlations between the Kus/ilmann-Anderson IQ and painting 
score, it appears that the Goodenough Test, a performance test 
measuring the developmental characteristics in children’s draw- 
ings of a man, does not relate any more highly to the child’s 
ability to achieve zsthetic quality in composition than does the 
usual verbal type of intelligence test such as the Ku/lmaznn- 
Anderson series. 

Bird (3) reports a correlation of .49+.03 between combined 
drawing score and IQ on the Goodenough Drawing-of-a-Man 
Test for a group of 248 subjects, the majority of whom were 
between 6—6 and 9-6 years, an age-range comparable to that of 
the subjects in this study. The higher correlation obtained by 
Bird is probably due to the fact that the drawing ability, which 
he measured, is more closely related to the aspects of the draw- 
ings of a man measured by Goodenough than is the measure of 
esthetic quality in paintings used in the present study. Still 
higher correlations ranging from .63 to .86 were obtained by 
Manuel and Hughes (27) who correlated IQ’s on the Goodenougli 
with ratings of the test responses for their quality as drawings, 
the records of 836 children in Grades I-IV inclusive being used. 
Furthermore, the correlations in the first and second grades were 
higher than in the third and fourth grades. The authors suggest 
that probably the factors which differentiate talent in drawing 
are less obvious in the first grade than they are in later grades. 
While the Thorndike Scale used as the basis for ratings of the 
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drawings is limited in its adequacy for this purpose, it is probably 
true that there is little to differentiate performance as measured 
by Goodenough and the artistic quality shown in a single drawing 
of a man, particularly in the earlier grades. Use of the phrase 
‘talent in drawing,’ however, in this instance is rather misleading 
because of the decided limitation in the measure of this factor. 

On the whole in the earlier grades, drawings of a man offer 
very little which is measurable as artistic quality apart from 
certain aspects of the drawings taken into account by the 
Goodenough Test. The Aline-Carey Scale for a man run- 
ning (20), for example, aside from the factor of the portrayal 
of action, lays stress on the inclusion of the different parts of the 
body, the correct attachment of arms and legs, the proportions 
achieved, etc., although, of course, not going into such detail nor 
dealing with developmental sequences as Goodenough does. 
Additional features, such as light and shade, do not appear until 
the later samples of the scale, representing work in the upper 
grades. McCarty’s Scale (25) for drawings of human beings, 
devised for ages 4-0 to 8-6 yrs., is still more closely related to 
the Goodenough Drawing-of-a-Man Test in the type of ability 
measured, dealing as it does with ability to express ideas vividly, 
clearly and accurately rather than with artistic merit. The low 
relationship between the Goodenough IQ and painting score in 
this study should be expected, however, since emphasis has been 
placed on the achievement of certain zxsthetic qualities in paint- 
ings, rather than on realistic representation, where certain aspects 
of measurement in common with the Goodenough may be present 
to increase the degree of correlation. 3 

Correlations between the Goodenough and Kuhlmann-Ander- 
son I1Q’s were as follows: Grade I, .43+.06; Grade II, .42+.07; 
Grade III, .42+.07, and Grade IV, .19+.07. In view of the 
.7/4+.02 correlation reported by Goodenough between IQ on the 
Drawing-of-a-Man Test and Stanford-Binet IQ, higher corre- 
lations would be expected and an explanation of the discrepancy 
is not apparent. These correlations for Grades I-III inclusive 
agree closely, however, with the r of .47+.03 reported by 
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Bird (3) for the Dearborn Group Tests of Intelligence. They 
also agree with Goodenough’s correlations with teachers’ judg- 
ments of ability, which were found to be .44 within the first 
three grades, with a drop to lower correlations thereafter. 

VII. Results of study. B. High school subjects. For the 
group of artistically superior high school subjects used in the 
second phase of the study, intelligence test results give an aver- 
age IO on the Kuhlmann-Anderson series of 109.35 in the case 
of the 24 starred or unusual subjects and of 107.30 for the 
remaining 48 subjects. These data show that the artistically 
superior also tend to be somewhat superior in intelligence, the 
group average falling at the upper limhit of average IQ’s, accord- 
ing to the classifications given by Terman (36). Rank in intelli- 
gence apparently gives little indication of the degree of artistic 
superiority, however, since the 24 starred subjects have only a 
slightly higher average IQ than the remainder of the group. 
Within the group of starred subjects, also, the subject indicated . 
as the most outstanding had an IQ of only 101 on the Kuhl- 
mann-Anderson series and of 105 on the Stanford series. On 
the basis of their 1Q’s one of the group of 24 outstanding 
subjects would be included in the Terman classification of dull; 
ten, average; eight, superior, and five, very superior. While 
there is a slight preponderance of these subjects in the superior 
and very superior groups it is obvious that an average IQ does 
not preclude the possibility of successful artistic performance at 
the high school level. 

These findings are at some variance with the results reported 
by Terman in his investigations of gifted children. Terman 
attempted to locate children of only average or moderately 
superior [QO who were outstanding in certain special abilities, 
including art, but in a school population of a quarter-million 
only 26 children (15 in art) were found who showed any 
unusual promise. A ‘ follow-up’ of these subjects showed that 
the early promise was not fulfilled in a single case. Terman, 
therefore, concludes that “ without superior general intelligence, 
special ability in music and art inevitably falls short of really 
great achievement. All of the young musicians and artists of 
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genuine accomplishment whom the writer has studied,” he adds, 
‘have had without exception, high intelligence quotients.” 

In considering Terman’s findings it should be pointed out that 
the degree of artistic ability of the children originally selected 
for the special ability group is somewhat in question. Although 
great effort was made to locate these special cases, with the final 
selection made from a special art class of supposedly talented 
children, satisfactory measuring methods were not available and 
Terman states that “‘ with two or three possible exceptions there 
was little evidence of anything more than ‘ copying ability’ ” (37, 
p. 34). On the face of this evidence there is room to doubt 
that the special ability group selected would be expected to 
achieve any marked success. The fact that a larger number of 
children with creative ability and with an IQ below the very 
superior level could not be located is difficult to explain except 
perhaps on the ground that adequate means for the selection 


- of such subjects were not available. It is also possible, as Ter- 


man points out, that such subjects could have been more easily 
identified at a somewhat higher age-level than that which was 
used. The finding that certain of the intellectually gifted were 
markedly successful in art, among other things, would be 
expected, since there are without doubt those children in whom 
high intelligence, combined ‘with artistic ability, contributes 
toward outstanding achievement in this specialized activity. As 
has been pointed out, instances of artistic superiority combined 
with high intelligence, although not as high as that possessed 
by Terman’s intellectually gifted, are also found in this study. 
The present study also shows, however, that the artistically 
superior need not necessarily be intellectually superior. 

Kerschensteiner and Kik, who report findings similar to those 
of Terman, seem to have located children demonstrating definite 
creative, rather than copyist, ability in art. Their conclusion, 
that great talent for graphical expression is regularly connected 
with good intellectual endowment, cannot be adequately inter- 
preted, however, for purposes of comparison with the results of 
studies using intelligence tests, since intellectual endowment was 
indicated to them solely by the child’s status in general school 
work. 
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While results for the artistically superior subjects in this 
investigation differ from those of Terman, and perhaps of 
Kerschensteiner and Kik, this disagreement is probably due to 
the limitations pointed out in these earlier studies. In the pres- 
ent investigation findings are based on measurements with a 
standardized intelligence test of a large number of subjects 
selected for their artistic superiority in creative, rather than 
copyist, activities.. It is fairly improbable that the average [O 
found was unduly weighted by a selection of artistically superior 
subjects from schools of predominantly lower or of predomi- 
nantly higher than average IQ level, since practically all junior 
and senior high schools in the Milwaukee system were used. 
Hence, the population from which the subjects were drawn would 
be expected to show a normal distribution of intelligence. 
Furthermore, the fact that the subjects were in the junior and 
senior high school grades would give increased surety of their 
artistic superiority, since there had been a longer period for 
observation of their work, than in the case of younger pupils, 


and greater opportunity for proof of their consistency in artistic 
performance. 


The results of this phase of the study dealing with the artisti- 
cally superior indicate the presence of a somewhat higher rela- 
tionship with general intelligence for the upper extreme of the 
distribution of artistic ability than for the normal group, if 
correlations similar to those fotind for the latter in the upper 
elementary school grades hold also for the high school level at 
which the talented subjects were selected. While no data are 
available in the study to explain this difference, it is likely that 
a certain degree of general intelligence is required to profit from 
the training necessary for the type of achievement demonstrated 
by the artistically superior at these grade-levels, with the result 
that average and above average IQ’s are found in the majority 
of cases in the selected group. The results of Manuel’s study of 
19 artistically talented subjects, ranging from fifth grade to 
university students, furnish some basis for this inference in 
regard to the relation between intelligence and development 
in drawing, although his evidence bearing directly on this point 
seems to be limited to the finding that one subject of inferior 
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intelligence seemed unable to profit rapidly by advanced instruc- 
tion in technique and that another subject of markedly inferior 
intelligence showed less originality’ than ability in graphic 
representation. 

VIII. Results of study. C. Adult artists. With the exten- 
sion of the second phase of the study to the highly selected group 
represented by the adult artists, results again show a higher 
than average degree of intelligence to be co-existent with artistic 
superiority, the superiority in this instance representing the 
extreme of genuine accomplishment among those following art 
careers. The average IQ of the group of 51 artists as measured 
by the Otis Self-Administering Tests of Mental Ability was 
118.43, which approaches the 120 IQ set by Terman (36) as 
marking the division between superior and very superior intelli- 
gence. Since this average is nine points higher than that for 
the 24 junior and senior high school subjects showing outstand- 
ing ability in art, indications are that a still higher level of gen- 
eral intelligence is necessary for eventual marked achievement 
in art. As with the younger group of selected subjects, how- 
ever, a wide range in IQ within the group is found, there being 
nine artists of average intelligence, 12 of superior, 26 of very 
superior, and one of ‘near’ genius or genius intelligence 
(according to Terman’s classification). With the adult artists 
further study of the nature of the relationship between artistic 
achievement and status in general intelligence was_ possible 
through experts’ ranking of the artists in the order of their 
importance. Correlations between rank in art assigned by Judge 
1 and rank in intelligence, using the rank-difference method (10, 
p. 190) was —.12 and for rank in achievement by Judge 2, 
.10, the equivalent product-moment r’s being —0.13+.10 and 
.12+.12, respectively. These correlations indicate that within 
the group there is no relationship between the individual’s rank 
as an artist *’ and his rank in intelligence. Thus it would seem 
that while a high degree of intelligence characterizes the suc- 
cessful artist group as a whole, intelligence level within the group 
may give little or no indication of the individual’s importance 


17 A striking observation in this connection is that about ten names .appeared 
consistently in the upper ranks of all four lists. 
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as an artist. Since many factors related to general intelligence 
doubtless operate in the attainment of eminence in art, as in 
other fields, it is not surprising to find a superior IQ level for 
the group. The ‘special ability’ character of the art talent 
factor, however, is supported by the absence of relationship 
between rank in importance as an artist and IQ. 

The possible effect of differences in chronological age on these 
results was checked through correlation with rankings of the 
artists and with IQ. The rank-difference correlation between 
chronological age and rankings by Judge 1 was —.37 and for 
Judge 2 was —.17, the equivalent r’s being —.39+.09 and 
—.18+.13, respectively. These correlations show a tendency 
for the judges to give higher rankings to the younger artists, 
indicating perhaps that of the more important artists greater 
co-operation was secured in this study from the younger than 
the older group. A —.14+.09 correlation between chronological 
age and IQ also indicated a tendency, though slight, for the 
younger artists of the group to score higher in intelligence. The 
simultaneous operation of the tendencies for the younger artists 
of the group to score higher in intelligence and to be ranked 
higher in achievement would have tended to raise the correla- 
tion found between rank as an artist and IQ. This would not 
alter the interpretations of the relationship between rank as an 
artist and status in general intelligence already given, however, 
since indications are that the low correlations of —.13 and .12 
are even somewhat higher than would have resulted if the factor 
of chronological age had been eliminated. 

The influence of certain other factors on the results cannot 
be checked because of the difficulties inherent in the manner of 
conducting investigations of this type in which direct testing of 
the entire group chosen for study is impossible. The extent 
to which the IQ distribution, found with the group of 51 artists, 
represents that of the larger group solicited cannot be known 
since some degree of selection relative to the factor of intelli- 
gence may have been operating to determine the group par- 
ticipating. Furthermore, since the intelligence test was in each 
instance entirely self-administered it is probable that the condi- 
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tions under which the test was taken were not equally favorable 
or well controlled for those who responded, as would be more 
nearly the case if a direct administration of the test had been 
possible. 3 

Possibilities for a more complete interpretation of the intelli- 
gence test results of the adult artists are also limited by the fact 
that similar data from groups of outstanding men in other pro- 
fessions were not secured for purposes of comparison. Holling- 
worth (14, p. 152) has expressed the opinion that “ probably 
those who achieve eminence in the arts are, on the whole, as 
highly endowed with general intelligence as are those who win 
eminence in other kinds of careers.’’ On the basis of the results 
for the artists in this study, however, it appears that the average 
of intelligence found in this group is lower than that which 
would be expected for similar groups in certain other profes- 
sions, at least in those in which achievement is more obviously 
dependent on the types of mental activity measured by intelli- 
gence tests. Little direct evidence in connection with this 
problem is available in the literature. Cox in a study of the 
early mental traits of 300 geniuses found the rating in intel- 
lectual traits for a group of 13 artists to be significantly lower 
than that for eminent men|in general. Since this study was 
made in retrospect, however, these findings are based on assumed 
IQ’s and are hardly comparable to results in the present study. 
While the classification, artist, does not appear in the familiar 
occupational intelligence ratings from the Army Alpha Test 
results (30, p. 829), one group of 28 artists appears in an earlier 
table in such a rank as would indicate an intelligence falling 
at the lower limit of the ‘B’ or superior group, below the 
occupations identified as engineer officer, medical officer, civil 
engineer, and a number of other professions. Since this investi- 
gation was quite limited in many respects and dealt with com- 
parative status among occupational groups without regard for 
rank within the group, it gives only an indication by inference 
that artists of marked achievement would be somewhat lower 
than men of similar achievement in the careers mentioned. 

For the field of music, experimental data on the relationship 
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between intelligence and achievement have been contributed by 
the extensive research conducted at the Eastman School of 
Music (35). While the problem was approached from a dif- 
ferent angle than that of the present study, these results in music 
offer possible inferences ‘for the situation in art, since both are 
regarded as special abilities. After considerable experimentation 
with prognosis of musical achievement at the Eastman School, 
it was found best to combine the results of the Seashore Meas- 
ures of Musical Talent with the lowa Comprehension Test (a 
college ‘intelligence’ test). In the cumulative key developed 
for this purpose, a series of score combinations are designated 
for each of five categories of prediction, ranging from ‘ dis- 
couraged’ to ‘safe.’ These show for the prediction of ‘ safe,’ 
intelligence ratings of superior, very good and high average 
combined with a superior talent profile. “With the artist group 
in this study [Q’s within approximately the same range were 
found to be present in the case of presumably superior artistic 
talent, although there were no actual measures of this factor. 
The fact that intelligence conditioned the use of musical poten- 
tialities to such an extent that a measure of this factor was 
included for prediction of success in the music school courses 
lends confirmation to the explanation advanced for the superior 
IQ average of the artists and the high average IQ of the selected 
high school group, since intelligence probably functions in a 
similar manner in achievement in art. | 

Analysis of intelligence test responses. In an attempt to dis- 
cover whether certain types of tests included in the different 
intelligence tests were more highly related to artistic ability than 
others, analyses were made of each, treating scores on the com- 
ponent tests or items separately. In the case of the Auhlmann- 
Anderson series, as has been pointed out, comparisons may be 
made between standing on the different tests in the battery since 
a mental age is assigned to each. The number of above and 
below median mental age responses made, on each test by a 
group of subjects, whose painting scores were above one sigma 
of the distribution in Grades III and IV of the Des Moines 
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18 
group, 


was therefore recorded in order to determine whether 
subjects of higher rank than the average in artistic production 
tend to score above or below their median mental age more 
frequently on certain tests than on others. On examination of 
the mental ages on the separate tests taken by the subjects in 
each of these grades, however, it became apparent that there 
was a general tendency for the group as a whole to score higher 
on certain tests and lower on others. Similar records for a 
group of subjects whose paintings were below one sigma of 
the distribution and for the larger group of subjects falling 
between +1 sigma, were therefore compiled. With these data 
for comparison results for the artistically superior group did 
not vary to any considerable degree from those for the other 
two groups, the differences in any case being slight and probably 
insignificant. With the type of tests included in the Kuhlmann- 
Anderson series it appears that relationship to artistic ability is 
neither more nor less marked in the performance on one test of 
the series than on another. 

For the group of 23 artistically superior subjects in the Mil- 
waukee Schools, to whom Stanford-Binet Intelligence Tests 
were given, results were recorded by separate tests for purposes 
of analysis. Of the responses of the 22 subjects who had a 
basal age below year 16, the tase of ‘passes’ on the indi- 
vidual tests of this age series was as follows: vocabulary, 6; 
interpretation of fables, 14; differences between abstract terms, 
10; problem of enclosed boxes, 12; repeating six digits reversed, 
12, and using a code, 16. For the 20 subjects passing one or 
more tests at year 18, there was the following number of passes: 
vocabulary, 3; Binet’s paper-cutting, 14; repeating eight digits, 
8; repeating the thought of a passage, 13; repeating seven digits 
reversed, 8, and ingenuity, 7. The largest number of ‘ passes’ 
in year 16, as will be noted, was on the ‘using a code’ test 
and in year 18, on Binet’s paper-cutting test. For the latter 
test, Terman states that “success seems to depend upon con- 


a 
18 A similar procedure was not followed in Grades I and II because of the 
presence of a number of maximum and minimum scores and was not carried 


beyond Grades III and IV because results did not seem to warrant its 
continuance. 
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structive visual imagination ” (36, p. 339). In the case of the 
former, while he points out that “the significance of the test 
must be expressed in other terms than the kind of imagery it 
may happen to bring into play,” he states also that “ nearly 
all subjects find it necessary to trace the code form in imagi- 
nation, from the beginning up to each letter whose code form 
is sought” (36, p. 331). Thus these two tests, particularly 
the Binet paper-cutting test, appear to tap certain functions 
which would be expected to be more important in artistic pro- 


duction. It should be pointed out, however, that there were 
almost as many ‘ passes’ on interpretation of fables in year 16 
and repeating the thought of the passage in year 18, in which 
the mental abilities most involved would seem to have little 
direct implication for functioning in art. In the light of the 
oft-expressed opinion that the artistic individual is visually 
rather than verbally minded, it is interesting to note that the 
vocabulary test was more often failed at both levels than any 
other test. 

This analysis of test responses is of course limited by the 
small number of subjects under consideration and by the fact 
that similar analysis of the responses of an unselected group 
have not been made. Skalet (34) has made such a study but 
gives only a tentative interpretation of results above the nine 
year group because of the limited number of cases used above 
this age level. Also, it is difficult to analyze the mental func- 
tions operating in performance on any particular test, and no 
extended measure of any function is given in a single sub-test 
as such was not intended in the development of the general 
intelligence test. Tests of special aptitudes, such as those recently 
developed by Thurstone, would have been more valuable for 
analytical purposes, but these are devised for the college level 
and are also quite time-consuming in their administration. 

For the adult artists, responses to each of the 75 items of 
the Otis Test were recorded separately and the number of failures 
on each item noted. Although Otis states in the manual of test 
instructions (31) that the items are arranged in order of diffi- 
culty, no data are given as to the exact difference in difficulty 
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between items. No weighting of the items to equate this factor 
is therefore possible for purposes of comparison. The grouping 
together of certain items of varying degrees of difficulty but 
dealing with similar materials, however, shows that a greater 
number of errors occurred in those items involving following 
directions and number concepts, particularly the latter. Items 
which have been classed as following directions deal mainly with 
locating the correct letter in a series by taking certain steps given 
in the directions, and those identified as dealing with number 
concepts involve the finding of the wrong number in a series 
of numbers. The greatest number of errors for a single item 
occurred on number 68 which deals with line and space concepts. 
This may be explained, however, by the fact that this problem 
is more of a geometrical (mathematical and abstract) than one 
of visual space relations. This item, as well as items 66 and 
67 on which there was a large proportion of errors, would be 
included with number concepts in the larger classification of 
mathematical abilities. While it thus appears that a majority 
of the items presenting difficulty for the artists involve mathe- 
matical abilities, items in a third group in the same classification, 
arithmetical reasoning, are missed no more often than others 
of approximately the same difficulty. 

Of the more difficult items, the smallest number of errors 
was made on items 71 and 69, which fall in the large classifica- 
tions of verbal and of reasoning abilities, respectively. No tend- 
ency to fall into more or less specific groups is apparent, however, 
in the case of the items showing the greater number of correct 
responses, as was true for the items most frequently missed. 
It appears that the test does not include items particularly diag- 
nostic of the types of abilities more peculiar to the artist’s 
equipment. 

IX. Summary and conclusions. This study was devised to 
determine the relationship of general intelligence to artistic 
ability, both in the degree to which it is present in the child 
population and as it is manifest in superior performance at the 
high school level and in genuine achievement in the artist. The 
entire study was planned in the light of previous findings which 
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had indicated the following considerations to be of major impor- 
tance, namely, age of the subjects, type of artistic ability 
measured, and nature of the group studied, whether unselected 
or composed of the artistically talented. The problem was there- 
fore approached with the view of determining the effect of these 
various factors, by considering both selected and unselected 
groups within a wide age-range, measured for a similar type of 
artistic ability. Data on representative drawing collected by 
previous investigators for unselected groups served for purposes 
of comparison in regard to the latter factor. 

For the first phase of the study each of¢the school grades, from 
I to VII inclusive, was represented by a sampling of 100 sub- 
jects, normally distributed as to intelligence, with an IQ average 
approximating 100. ‘The artistic ability of these subjects was 
determined by their average scores on three paintings, judged 
for zsthetic quality by means of scales devised for this purpose. 

For the first of the selected groups included in the second 
phase of the study, a canvass of the junior and senior high school 
population in a large mid-western city resulted in the rating of 
72 students as artistically superior by the art supervisors and 
teachers, the former designating 24 of this number as outstand- 
ing. The second group, which represents a high degree of 
selection at the adult level, was composed of 51 artists of recog- 
nized achievement in their profession. Two expert judges 
ranked the artists in order of importance. 

Measurement of the factor of general intelligence was based 
in each instance on responses to a standardized intelligence test, 
the Kuhlmann-Anderson in the case of the normal and selected 
high school groups and the Otis Self-Administering Tests of 
Mental Ability in the artist group. The Goodenough Drawing- 
of-a-Man Test was also used for certain subjects in the 
first group. 

A. For the normal group (grade school subjects). On the basis 
of correlations between the measures of intelligence and of 
artistic ability, the following conclusions are drawn: 

(1) Only a slight relationship is found between general 
intelligence and artistic ability in the grades above III, where 
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the correlations are low and unreliable in each instance, being 
slightly negative in Grade IV. For the lower grades the rela- 
tionship is significant for Grade III, and partial correlations, 
holding CA constant, show an increase in relationship in Grades 
I and II, indicating a positive but only a fair relationship between 
general intelligence and artistic ability at these years (ages 
6 to 8). 

(2) The relation of general intelligence to artistic ability, 
functioning here in free or creative expression, appears to be 
virtually the same as that found by other investigators for ability 
in representative drawing. 

(3) A high relationship to general intelligence is not found 
in Grades I to III inclusive which parallel the ages in which 
development in drawing, regarded as a general means for expres- 
sion of emerging concepts, has been shown to be highly related 
to general intelligence as contrasted to the latter ages in which 
the importance of this factor decreased with a change in the 
nature of drawing. The present results agree, however, to 
the extent that a low to fair. degree of positive relationship is 
manifest in these earlier grades. 

(4) For the Goodenough Drawing-of-a-Man Test a low 
degree of relationship to artistic ability exists as indicated by 
low, positive correlations in /Grades I to IV inclusive. Since 
this finding is in general agreement with that for the Kuhlmann- 
Anderson test in these grades, particularly I to III inclusive, 
it appears that for the type of artistic ability considered in this 
study there is no greater relationship to an intelligence test based 
on graphic conceptualizing (the Goodenough) than to the usual 
verbal type of intelligence tests. The fact that a higher degree 
of relationship has been reported by other investigators in the 
case of representative drawing may be explained by the presence 
of certain more or less common aspects in the measurement of 
object drawing for accuracy of representation and for the 
developmental characteristics considered by Goodenough. 

B. For the artistically superior high school subjects and 
adult artists. 


(1) There is a tendency for artistically superior subjects at 
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the junior and senior high school level to be somewhat superior 
in intelligence, although not to the degree suggested by Terman. 
The average IQ for the group is near the dividing line between 
‘high average’ and ‘superior’ in the classification as given by 
Terman. This position on intelligence is not commensurate with 
the subjects’ manifest superiority in artistic ability. 

(2) This same tendency is apparent in the adult artist group 
although more markedly than in the case of the high school 
subjects, the average IQ for the artists falling near the line 
separating ‘superior’ from ‘ very superior.’ 

(3) While the IQ average of the artist group is high, it does 
not necessarily follow that there is a direct relationship between 
intellectual status and importance as an artist. The rankings of 
these artists by two experts, furthermore, suggests that such a 
relationship is doubtful. 

C. General summary. In considering the resuits of both 
phases of the study the following summary of conclusions may 
be made: 

(1) Artistic ability is a special ability in the sense of being 
only somewhat related to general intelligence as measured by 
established tests. This conclusion, however, applies primarily 
to the normal group. 

(2) There is a tendency for a higher than average degree of 
intelligence to be present with artistic superiority as indicated 
by the average of IQ’s found in the case of the selected groups. 
It appears that the use of artistic potentialities in the achieve- 
ment of marked success in art is dependent to a certain extent 
on intellectual capacity. While no data having direct bearing 
on this point were obtained in the present study, results from 
other investigations, particularly in the field of music, lend con- 
firmation to such an interpretation of the function of intelligence 
in artistic achievement. 
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THE STABILITY OF ARTISTIC APTITUDE AT THE 
CHILDHOOD LEVEL’ 


by 


AuLusS WaArRD SAUNDERS 


I. Introduction. In the realm of special ability there is a 
widespread belief that artistic ability is inborn and remains more 
or less fixed throughout life. Among many art teachers, on 
the other hand, the belief is likewise widely held that anyone 
can learn at least the skills utilized in drawing and painting. 
The objective of this study was to test the possibility of art 
ability ° at the childhood level being changed, with an attempt 
to identify the factors that might be involved if such a change 
were brought about. In other terms, the problem was as stated 
by Kinter (27), “Is artistic aptitude relatively fixed and stable, . 
or can it be altered by intensive training? ”’ 

In general the prevailing attitude in the field of art psychology 
has been that an individual is born with a definite capacity for 
artistic appreciation and production and that this capacity is 
bounded by limits, some point within which he will succeed in 
attaining, depending on the nature of his environment and his 
endeavor. This investigation has been directed toward the nature 
of these capacity-limits, in an attempt to discover whether it is 
possible to produce a definite extension of the usual limits. This 
plan was specifically attempted, first, through the use of a number 

1 An investigation in the research program, Genetic Studies in Artistic 
Capacity, under the direction of Dr. Norman C. Meier and made possible by 


financial aid from the Carnegie Corporation of New York through the Carnegie 
Foundation for the Advancement of Teaching. 

2 Such terms as ability, capacity, aptitude, and the like are used herein as 
defined in English’s “ A Student’s Dictionary of Psychological Terms” (17). 
Ability, for instance, is there defined as “actual power, whether inherited or 
acquired, to perform any act. Capacity, while nearly synonymous, usually 
means undeveloped power, and is largely innate; under training it becomes 
ability.” For definition of such art terms as balance, rhythm, harmony, etc., 
see “ Report of the Committee on Terminology ”’ (61). 
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_of subjects who, at the start of the investigation, were definitely 
inferior, and a few who were definitely superior in art ability, 
and secondly, through the examination of a section of the 
general population at the same age-level. 

The field of this problem was restricted to school children 
within the age-range-of five to nine years. Children of this 
age have been found by TJiebout (57), Grippen (22), Whor- 
ley (62), and others to be old enough to function artistically, 
and at the same time the responses of subjects of this age 
seem to be more naive and less conditioned by learned responses 
in the field of art. F Bai 

II. Procedure and subjects. The approach to the problem 
was made in two ways: (1) through the stimulation and meas- 
urement of a picked group of artistic and inartistic children 
(hereafter called “ Group A’’), and (2) through the study of 
an adequate sampling of the general population at the same 
age-levels (called “Group B’’). 

The plan, as actually followed, was to test the possibility of 
change in art ability by selecting a number of otherwise normal 
children who were definitely inferior in art ability, and exposing 
them to a two-year period of planned art stimulation to test 
the possibility of a resultant change in the level of art ability 
and, in case of change, to identify the factors involved. The 
stimulation was to include instruction, motivation, and _ tech- 
niques of any reasonable nature designed to influence and 
stimulate the subjects artistically. Measurement of art status 
of each subject was to be made at the beginning, mid-point, 
and end of the period of stimulation by the use of objective 
methods of evaluating the original, graphic products of the sub- 
jects produced under controlled conditions. Added to this list 
of inferior subjects was a small number of artistically superior 
subjects to be included in the group during the second year of 
stimulation. These were studied to see the comparative amount 
of change which might result. The two types of subjects were 
to constitute Group A. 

Comparison of Group A with a section of the general popu- 
lation, Group B, was to be made in which measures of the per- 
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formance of Group B were to be obtained in various controlled 
situations. This would produce standards for the measurement 
of Group A performances. Group B was to be used, further- 
more, in an analysis of various factors differentiating the artistic 
from the inartistic, thus procuring a knowledge of the factors 
to be used, in turn, in the stimulation of Group A. 

Group A consisted of 14 children, who were at the start of 
this program (Nov., 1932), enrolled in grades Junior Primary 
to 3 in the University Elementary School in lowa City. Data 
describing these subjects are presented in Table I. Of. this 
group, half were male and half were female. All 14 subjects 
were normal, bright (with one exception) children from normal 
homes in or near lowa City. The average of intelligence of 
the group was superior, the A.M. of the IQ’s being 121. The 
only child below normal was a boy with an IO of 88, who was 
repeating the second grade. The intelligence quotients were 
obtained from Stanford-Binet tests given on dates near the 
beginning of the program of stimulation. 

Of the 14 subjects, 10 were selected because of their demon- 
strated inferiority in art ability, and will be called ‘Z’ subjects. 
The remaining four were chosen because of their demonstrated 
superiority in art, and will be’known as ‘ X’ subjects. 


TABLE I. Sex, imtelligence quotient, chronological and mental ages, 
estimated art status of group A subjects (Nov., 1932) 


School Art 
Subj. Grade Sex CA IQ MA Status 

1 3 M 8-6 115 9-9 Z 
3 F 8-10 101 8-11 z 

3 3 M 7-4 114 84 Z 
4 2 F 7-8 116 8-11 X 
5 2 M 6-10 133 9-1 Z 
6 2 M 6-11 140 9-8 x 
7 2 M 8-4 88 7-4 z 
8 2 F 7-7 120 9-1 X 
9 2 M 7-7 111 8-5 Z 
10 1 F 6-2 149 9-2 X 
11 1 F 6-3 122 7-8 Z 
12 l F 5-11 129 7-8 Z 
13 1 M 6-2 131 8-1 Z 
14 J.P F 5-2 123 6-4 Z 

A.M.= 7-1 121 8-7 
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TaBLe II. Z and X subjects of group A compared 


Sex: Av. Av. Av. 

Subj. No. M F CA IQ MA 
Z 10 6 4 7-1 116.70 8-3 
xX 4 1 3 7-1 131.25 9-3 


The method of selection of the subjects was as follows: 
subjects 3, 6, 8, 9, 10, and 13 were used previously as X and Z 
subjects by Rodgers, Grippen, Tiebout and Dow. These six 
formed part of a larger list of X and Z subjects studied by 
these investigators, who used a series of, ratings by pre-school 
teachers, research assistants, and art teachers; evaluations by 
three members of the art education staff; and performance on 
the Aline-Carey Drawing Scale (29), as methods of establish- 
ing the artistic inferiority and superiority of these children. A 
more detailed treatment of the method of selection is given 
in Lodgers (51) and Grippen (22). The remaining subjects, 
1, 2, 5, 7, 11, 12, and 14 were selected on the recommendation 
of the art teachers and the art supervisor of the elementary 
school, in addition to the previous ratings of these subjects 
made by the division of art psychology. Since this evidence 
was not considered sufficient for the use of these children as 
subjects in this investigation, an additional check on all ‘ can- 
didates ’ was made by the investigator in the form of a pro- 
cedure of ranking of paintings made by the candidates. In 
this procedure, four additional children were used, making a 
total of 18 candidates considered. Three experienced super- 
visors of public-school art, each in a different city, were used 
as judges who were asked to rank the paintings (free expression, 
on standard-size paper) of the candidates in order of merit, 
and to grade the paintings individually as to ability shown, con- 
sidering the age and grade in school of the child. Only those 
candidates whose paintings, by a close agreement of the judges, 
qualified them as distinctly superior or distinctly inferior in art 
ability were retained for use as subjects. By this process two 
of the candidates were eliminated. As two more were with- 
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drawn from use by the school authorities for other purposes, 
a total of 14 qualified subjects remained.* 

_ IIT. The program of stimulation. The program of stimula- 
tion of Group A covered, roughly, two academic years of time, 
starting in November, 1932, and extending through May, 1934. 
The first-year period extended through May, 1933; the second- 
year period began in November, 1933. The Z subjects of 
group A, 10 in number, were used during the entire period; 
the X subjects were used for stimulation only during the latter 
year. Loss of subjects due to death, illness, or removal from 
the school system reduced the total number materially during the 
period. The time of group membership (not including group 
absences for short periods of time) for the 14 subjects is 
indicated in Table IV. (Page 143.) 

Thus seven Z subjects were stimulated during both year 
periods, three Zs during the first-year period only, two X sub- 
jects through the second-year period, and two Xs during part of 
the second-year period. For purposes of measurement, all but 
two subjects (1 and 12) were available throughout the entire 
period. 

Specifically the program of stimulation consisted of a two- 
year series of meetings of the group with the investigator, in 
an art-class situation in which the subjects were instructed, 
reeducated to some extent, and, for the most part, allowed to 
express their own ideas in various art media under the observa- 
tion of, and with the help of, occasional criticism of the investi- 
gator. Included in the program in addition were: the use of 
proper media for direct expression, natural conditions of work, 
effective motivation, a clinical report of each meeting, and home 
visitations. 

The investigator was a trained art teacher, with six years of 
art-teaching experience in a junior high school. It is probable 
that he did not bring into the situation a teaching ability of an 

3It is to be understood that the foregoing list of rankings, ratings, and 
judgments are not to be considered as the establishment of the exact art status 
of the various subjects. An objective procedure of measurement was made for 


this purpose, furnishing a closer estimate of the art status of each subject at 
various periods of time. 
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extreme degree. In fact, the entire program of stimulation was 
not abnormal in that, in the estimation of the investigator, the 
methods, materials, and quality of instruction could be duplicated 
in any efficient school. 

The physical conditions were not ideal, as no art-room, with 
its proper lighting, water-supply and adequate working and stor- 
age space was available. During the course of the investigation, 
four different rooms were assigned, all rather small, which made 
necessary the division of the group into two sections, each meet- 
ing at a different time. The equipment used consisted of a 
child’s upright easel, chair, and table spa¢e -for each subject. 
The art materials included an adequate supply of 16 in. x 22 in. 
white drawing (book) paper, colored crayons, black sketching 
sticks, soft, colored chalk, modelling clay, tempera paint in 2-oz. 
bottles, large (% in.) sable brushes, and glasses for holding 
water. The majority of the work was done with the soft chalk 
and tempera paint. 

Each of the two sections of the group met with the investigator 
twice a week for periods of 30 to 45 min. Since these periods 
were scheduled as far as possible to coincide with the time of 
meeting of the school’s regular art-classes, practically all of the 
subjects received only the regular time allotment for art-instruc- 
tion: Hence the subjects, for the most part, were exposed to 
art influence during two periods per week. 

The class routine followed this plan, in general: the subjects 
came (or were brought) to the experimental room where they 
seated themselves in a row of chairs. When all were present 
each subject took a place at the table (if clay was to be used), 
or stood at his easel to paint or draw. The child always found 
the easel stocked with several sheets of paper, water, paint, and 
a brush on the floor or on the chair or table at the side of his 
easel. The easels were arranged under favorable lighting con- 
ditions and so placed that a subject standing at his easel could 
not easily see what the other subjects were painting. A mod- 
erate amount of quiet talking among the subjects was permitted 
by the investigator, who strove to attain in the group an atmos- 
phere of naturalness. At the end of the period, the child named 
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his product or products, washed out his brush, dumped his glass 
of water or boxed his sticks of chalk, and was excused from 
the room. . 

The method of instruction used was a combination of the 
so-called ‘ free expression’ method and the directed training of 
observation (representation) type. In this method, as followed 
in this program, the subjects were allowed to paint or draw 
their own ideas from a given general theme or subject, working 
largely from memory and imagination. Criticisms by the 
investigator were given sparingly—usually only ‘upon request. 
Help usually took the form of quick drawings made before the 
class (these being covered before the subject started to work), 


or in verbal descriptions of possibilities for work. Occasionally 
a simple, diagrammatic sketch was made before the group to 


show gross relationships and proportion of objects under dis- 
cussion, and even a jointed cardboard figure, constructed of 
simple oval pieces pinned together at the joints, was resorted 
to in studying the human figure. Such sketches and aids to 
observation always followed the needs and requests of the child; 
they were never introduced primarily as a subject for discussion 
or observation. The child was repeatedly urged to express his 
own ideas in his own way, 'as it was felt that, in addition to 
the possible production of such values as the development of 
the child’s self-confidence and pride in his own creation, this 
type of resultant graphic product would obviously afford a sig- 
nificant indication of the status of the subject’s art ability. Thus 
creative expression was considered by the investigator to be 
more probably indicative of the art ability of the subject. The 
following of this practice, assisted at times by common aids 
to drawing and imagination, expressed both graphically and in 
clay, constituted in the main the technique of instruction. 

One important phase of the program of stimulation was the 
system of motivation used. The motivation can be classified 
under four heads: (1) judicious praise, (2) visits to art 
exhibits, (3) exhibition of subject work, (4) possession of art 
materials at the subject’s home. 

Since during the first year all the subjects were Zs, the praise 
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used took the form of casual but definite approval to anything 
the subject did which showed sincere effort. Occasionally dis- 
approval was used in the reverse circumstance. Poor drawing, 
color, and other evidences of lack of ability were not brought 
to the subject’s attention. It was attempted to make the subject 
feel that his efforts, even though different from those of others, 
still possessed merit. The subjects were permitted to think they 
had some ability; most of them believed that they had been 
selected for inclusion in this special class for that reason. The 
praise factor, as striven for in this program, was directed 
toward a realization on the part of each subject that his own 
graphic products were meritorious; if necessary, the investigator 
gave actual help in painting to make such realization possible.* 
This form of help was given only to prevent the occurrence of 
an unfortunate effect, or to correct one already made; it was 
given quickly and quietly. The child was led to believe, if 
possible, that practically all the work was his own. The investi- 
gator was forced to make use of this “breach of teaching 
ethics’ only rarely, a practice which seemed amply justified in 
this experimental situation. 

Visits to exhibitions of oil paintings were occasionally made. 
During the two years, four such visits were made. The sub- 
jects were allowed to look at the paintings as they pleased, 
attention to the few remarks of the investigator relative to the 
merits of various elements of the pictures being entirely vol- 
untary with the subject. In discussing these paintings, as in 
the class work, the investigator strove for esthetic appreciation 
rather than an understanding of the degree of realism used, 
knowledge of the ‘story’ depicted, or factual material such as 
names and dates. 

Subjects were taken in small numbers on various Saturday 
mornings to a studio, where they saw large easels, oil paints 
and unfinished pictures about them. In this environment they 
were allowed to draw and paint, using easels and materials 
identical to those employed in the regular class period. 


4 Such products were eliminated from the list of paintings used for purposes 
of establishment of artistic status. 








eee tT ES ee ee en 


Bea, eee Te Oe 


TT athe 
ie es 


Cs 


at 
24 
a 
ad 
i 
(ear 
ee 
ie 
rt 
a. 


} 


134 AULUS WARD SAUNDERS 


The exhibition of paintings made by the subjects was 
arranged largely during the second year. ‘This motivating 
factor took the form of an occasional showing of one or more 
paintings by a subject to the group or to the pupils and teacher 
of the subject’s school room; this factor also included the paint- 
ing of a special picture by each subject for his parents, framed 
with an appropriate mat and delivered to the subject’s home 
immediately preceding Christmas; it likewise included a formal 
school exhibit of framed paintings by the subjects, hung for a 
week and a half on the walls of two special school rooms. 

After every meeting of the group the investigator kept a 
record in which clinical notes of the proceedings were recorded. 
Responses of the subjects in the various situations of successful 
endeavor, failure, and praise were noted, as well as all situa- 
tions of esthetic significance. Assistance in this phase was 
obtained by the occasional use of a stenographer, who recorded 
verbatim samples of the subjects’ conversation, as well as 
responses to questions and remarks of the investigator. 

A final phase of the program of stimulation was the contact 
kept with the home of each subject over the entire period. At 
the beginning of the program an easel and full set of art sup- 
plies were taken to each subject’s home, where acquaintanceship 
to some degree with the home conditions of the subject was 
made and where relations of co-operation with the parents were 
established. An effort was made to obtain for each child some 
degree of freedom and interest in painting and drawing at home. 
In some cases the parents were urged not to send the child to his 
easel merely to keep him out of mischief, or to prevent him 
from doing something else that he wanted to do at the time. 
After the first visit a second was made, during which a pre- 
pared list of questions was asked about the subject, and obser- 
vations were noted. <A few additional visits were made to each 
home to replenish art supplies and to collect paintings. The par- 
ents were not informed as to what the exact basis of selection 
of their child had been. 

Group B consisted of approximately 100 pupils each from 
grades 1 through 4 from the Des Moines Schools. Since Group 
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B was used only during the second year of the program, grades 1 
through 4 paralleled the grades of the special X and Z subjects. 
Each grade presented approximately 100 children, whose distri- 
bution of intelligence scores approached normality. Intelligence 
was measured by the revised form of the Kuhlmann-Anderson 
group intelligence test, given by Tiebout at the start of the 
Group B program (Nov.—Dec., 1933). Descriptive data 
concerning Group B are supplied in the Tiebout article.” 

The program for Group B consisted, in brief, of three steps: 
(1) the subjects were selected as previously described; (2) all 
subjects were given an esthetic sensitivity tést; (3) each sub- 
ject made three original paintings from which his art status 
was estimated. 

Immediately following the selection of subjects for Group B 
on the basis of IQ, the responses of the subjects to the group 
esthetic sensitivity test were obtained. This test had been pre- 
pared by the investigator for this program; it consisted of a 
series of various art situations in which each subject’s choice 
was recorded. Since each subject’s art status (painting score) 
was later to be determined, the present esthetic test provided 
possibilities for discovering whether there were esthetic sensi- 
bilities differentiating the artistically superior from the artistically 
inferior subjects. Information concerning the nature of these 
differentiating sensibilities was held to be of major importance, 
both as general knowledge in this field, and as specific assistance 
which it would offer in the stimulation of Group A subjects, 
and in interpreting the findings therefrom. The esthetic test 
likewise provided an opportunity to study the growth of zsthetic 
sensitivity in average children from year to year, furnishing a 
standard of performance against which the comparison of the 
special subjects constituting Group A might be made possible. 

The art ability-status of the Group B subjects was obtained 
from a procedure of objective evaluation (described in the fol- 
lowing section) of three free paintings made by each of the 
subjects. These paintings were made under set conditions. In 
Grade 1-B (B=first semester) two paintings and two crayon 


5 Tiebout, C. E., pp. 87-89. THIS VOLUME. 
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drawings were made by each subject. The crayon drawings 
were included because of the lack of experience of these subjects 
with the medium of paint. 

It is to be noted that the same mechanism (scale) which was 
developed for use in estimating the art status of Group’B sub- 
jects provided the means for measuring the art status of the 
special X and Z subjects at different periods in the program of 
stimulation, thus determining the amount of gain made by them 
from time to time. 

IV. Measurement of art ability-status. The technique of 
using a reliable scale in judging actual achievement in art 
appeared to be the best method for use in this field. Brooks (4) 
found that a drawing scale reduces the inaccuracy of the rating 
by one-half of what it would be without the scale. Granting 
the availability of a reliable scale, the most satisfactory method 
of determining the art ability-status of the subjects, as repre- 
sented by their graphic products, would be the use of such a 
scale by several competent, experienced judges, using the aver- 
age of their ratings as a satisfying approximation to the real 
status. Since no available scale, including the Thorndike and 
the Kline-Carey® was found to be particularly suited for this 
purpose and age range, it was therefore necessary to construct 
such a scale that would fit the requirements and needs of this 
particular situation. 

The scales for each of the grades having been established 
with the co-operation of Dr. Tiebout and described in her 
article,’ the departmental judges judged the Group B paintings, 
this time using the scale to guide their concepts of art value. 
They did not, of course, re-judge any of the samples sent to 
the expert judges. As a result, the art ability-status, or paint- 
ing score, of each subject was established by the average rating 
of four departmental judges with the help of a scale, or by 
the average rating of 14 expert judges working without a scale. 

To carry out one of the purposes of this investigation it was 
necessary to identify a number of subjects as superior (X) and 


6 The Kline-Carey was not considered sufficiently refined at the lower age 
levels to serve this purpose. 


7 Tiebout, C. E., op. cit., p. 89 et seq. 
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inferior (Z) in ability and to distinguish them from the average 
(Y) performers. It was thus necessary to place a dividing line 
at the limits of the average group. This division was made at 
one S.D. above and below the mean in each grade, as shown 
in Figs. 1-4. This procedure resulted in the isolation at either 
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end of approximately 16 per cent of the cases. Although an 
arbitrary procedure, this area of Z and X subjects did not seem, 
according to the subjective experience, to be either unreasonably 
high or low in the general population. 
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The number of actual cases in Group B identified in this 
manner was as follows: 


Grade xX Z 
1 18 21 
2 19 14 
3 15 13 
4 13 15 


The purpose of the identification of these subjects, as previously 
outlined, was to study comparatively their sensitivity to various 
art situations as presented in the group esthetic test. 
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V. The analysis of esthetic sensitivities. The so-called group 
zesthetic test was in reality a series of controlled art situations, 
so presented that one variable, e.g., color harmony, was intro- 
duced at a time. The presentation was made graphically on 
a screen before a group of subjects; the materials were of a 
dual-choice nature to which the subject responded by recording 
his choice of the better or more artistic of the pair. At the 
end of the test his recorded choices revealed the subject’s sensi- 
tivity to the series of art situations compared to another subject’s 
sensitivity. These art materials or situations were not used as 
a test of the sense of identification of subjects as superior, aver- 
age or inferior in either art appreciation or ability; they were 
used to study the comparative sensitivities of subjects previously 
so identified. 

The materials used for the group zsthetic test were assembled 
from actual test material taken from tests and scales, namely, 
those by McAdory, Meier-Seashore, Whorley, Christensen, Lew- 
evenz, and Saunders. Thus at the beginning the chances for 
presenting valid art material were much improved. Only those 
test items which seemed to be within the grasp of the age-range 
used in this study were retained, which eliminated the greater 
portion of the items of the above tests. While some of the 
material was already in dual-choice form, a portion, existing in 
multiple-choice form, had to be simplified. In the latter case, 
as in the McAdory situations, where four items to be ranked 
were presented on a card, the items selected for use were those 
of extremes in rank, or near-extremes, thus doubly insuring the 
fact that one item of the pair was better in art value than the 
other. This treatment obviously simplified the judgment required 
by the subject. It was felt that this was advisable—even 
necessary—when very young subjects were used. 

The test materials were prepared for use in an opaque pro- 
jector (1,000-watt) throwing the image directly to the back of 
a translucent screen. Since the items included drawings, half- 
tones, and photographs in black-and-white and in color, this type 
of equipment was found satisfactory for presenting the test 
material in increased size to an audience of subjects. 
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The test was shown a number of times to groups of critics 
and to classes of school children for the purpose of refinement 
on the bases of validity, appropriateness to age-range tested, and 
fidelity of image. As a result, 56 items made up the test as 
used. The first item was a practice exercise and was not scored. 

In the response sheets, which were mimeographed, no writ- 
ing was necessary beyond the writing of the subject’s own name. 
Even this was not done by the subjects themselves in the 1-B 
Grade. Response to each art situation was made by drawing a 
circle around an initial on the sheet standing for the picture 
which, in the judgment of the subject, was better or more 
artistic. In the first grade every response of every subject was 
checked by an assistant before the next situation was presented. 

Scoring of responses was done according to the number right. 
The maximum score was 55. Since by pure guessing, or by 
marking all responses A or all B, it is evident that a score of 
27 or 28 would be probable, therefore the base line would be 
a point half way between O and 55, or 27.5. The usual method 
of scoring true-false tests, by subtracting the wrong from the 
right responses was not followed in this instance, as this situation 
was not so much one of wrongness or rightness, as of esthetic 
preference,—the score wanted being the number of times the 
subject chose the more artistic of a pair presented. 


TABLE III. Averages of @sthctic test scores by grade, group B 


Grade Av. o 
1 29 .00 8.63 
2 33.45 10.92 
3 36.00 12.34 
4 39 .97 8.59 


VI. Results. The results of this investigation fall under two 
heads: (1) evidence of the possibility of art ability to change,— 
this being shown by the amount of change produced in the 
Group A subjects; (2) the identification of certain esthetic 
sensitivities which differentiate the typical performance of X 
and Z subjects, resulting in the description of a number of 
component factors of art ability in children. 


(1) The possibility of change in art ability. The amount of 
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actual change in art ability of the Group A subjects resulting 
at the end of the two-year period of stimulation was obtained 
by measuring the difference between the original status of each 
subject and his final status. 

The following method was used in the determination of art 
status: the painting scale developed from the 1921 paintings 
of the Group B (normal) subjects was used as the rating device. 
Two groups of judges were used: the first was composed of 
four qualified art-psychologists and artists; the second group, 
used as a check on the first, consisted of five artists from out- 
side the staff who were not eSpecially familiar with children’s 
work. The experimenter did not act as a judge. Each of the 
subjects was represented by a number of free paintings made 
without assistance of any kind and produced under controlled 
conditions similar to those used for Group B. The chief dif- 
ference was that each child of Group A was represented by a 
larger number of paintings, the average number at the begin- 


ning period being 5.0, at the mid-period 4.6, and at the final 
period 6.6. 


Measurements of art status were made at three periods of 
time: (1) at the beginning of the period of stimulation, Nov., 
1932; (2) at the mid-period, a combination of the last of the 
first year and the beginning of the final year; (3) the final 
period, or end of the period of stimulation, viz., April-May, 
1934. All the finished products made under the proper 
(described) conditions and during the above periods of time 
were included in the list. There was no picking and choosing 
of entries. 

The first group of judges (art-psychologists and artists 
acquainted with children’s work) rated the Group A paintings 
on the scale,—the paintings made by the subject while in a 
certain grade being rated on the appropriate scale. Nine paint- 
ings of the list, made in the junior primary (kindergarten ) 
were rated by necessity on the Grade 1 scale. A correction was 
allowed for this and will be discussed later. 

Table V [p. 144] shows the average painting scores of each 
subject’s pictures for the three periods of time. The painting 
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score represents the average of the four judges’ ratings and an 
average of the number of pictures listed for that period. 

These painting scores are a somewhat reliable approximation 
of the art ability of the 14 subjects. The inter-agreement of 
these judges (first two correlated with last two) was found to 
be .84+.008. Using a second group of judges to check the 
validity of the first group, an agreement of .81+.009 was found 
to exist,—even though the second group were all artists inter- 
ested in different fields, not familiar with children’s work, and 
not practiced in or concerned with scientific measurement. Their 
inter-agreement (average of three correlated with average of 
remaining two) was found to be .69+.013. 

To find the amount of net change in each subject’s status 
from first to final period required the establishment of the 
normal increment or decrement from grade to grade. Accord- 
ingly, the pictures constituting the scales for Grades 1, 2, and 
3 were rated on the Grade 4 scale by a final group of judges. 
Six judges made up this group; all were experienced artists 
and art teachers. Four of them were new to this situation,— 
the remaining two had used the scales a month previously for 
a short period. Each of the judges was uniformly instructed 
previous to rating the paintings by a prepared form. The aver- 
age of the six ratings of each painting was taken as the score 
for that painting; the average for each entire grade was then 
computed with the following results: 


Grade 4 3.00 (assumed) 
Grade 3 2.55 (derived) 
Grade 2 2.58 (derived) 
Grade 1 1.82 (derived) 


Thus the normal performance for Grade 3 by this method is 
.45 in painting score value below the normal performance of 
Grade 4.° Grade 2 is then practically equal to (.03 better than) 
Grade 3 in value, and Grade 1 is .76 below Grade 2. 


8 Some doubt exists on the part of members of the staff as to whether the 
actual difference in value between Grades 4 and 3 is as great as here indicated. 
Inexperienced judges seem to have a tendency to rate pictures low on a scale, 
as if there were a ‘halo effect’ surrounding pictures hung on a wall. The 


average ratings of even a large group of: paintings seldom falls at the mid- 
point (3.00) of the scale. 
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Since all paintings were scored on the grade scale appropriate 
to the grade of the subject making the painting, the difference 
in value between the grades would be taken care of in the scor- 
ing process. However, because the first period paintings of the 
subjects of Group A were made four months of the year earlier 
than the final paintings, four months of the yearly increment 
was deducted to cancel this factor of growth. Thus the net 
change in each X subject’s painting score over the two-year 
period was found by deducting from the gross change 1 the 
appropriate increment for that year. The resulting figures indi- 
cating amounts of net change in painting scores together with 
the related data are shown in Table IV. 


TasLeE IV. Change in painting score of group A subjects 


First Mid- Final Gross Net 
Subj. Grade Period Period Period Change Change 

1 3-4 2.37 2.09 — —.28 —.43 
2 3-4 2.02 2.12 2.85 83 68 
3 3-4 2.03 2.96 3.51 1.48 bw 
4 2-3 3.34 2.59 3.49 18 19 
5 2-3 1.48 2.42 3.12 1.64 1.65 
6 2-3 3.70 2.45 1.83 13 14 
7 2-3 1.42 2.14 2.50 1.08 1.09 
8 2-3 2.94 3.04 3.69 5. 76 
9 2-3 1.48 2.48 2.53 1.05 1.06 
10 1-2 3.88 2.91 3.55 —.33 — 58 
11 1-2 2.39 2.40 3.37 .98 Ja 
12 1-2 2.14 2.78 — 64 39 
13 1-2 1.60 2.50 3.16 1.56 1.31 
14 JP-1 1.82% 2.84 3.50 1.68 £55 


*In adjusting the scores of junior primary paintings, an increment of .39 was allowed, 
this being the average increment between Grades 1 and 4. 


Subject 1, originally classified as a Z, is shown by the score 
of his first period paintings as being above the upper Z level, 
which is 2.03 for Grade 3. This subject is shown to have made 
almost no progress during the one year during which he was 
included with the group, as the gross change shown is between 
zero (normal progress) and —.45, or a total lack of progress 
(subtracting the increment from the normal rate). Subject 10, 
the highest ranking X at the first period, is shown to have made 
practically no progress during the two years. This subject -was 


present as a member of the stimulated group during only part 
of one year. 
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When the 12 subjects for which there are complete records 
are considered in order of original standing, it is seen that in 
general those of higher status made less progress than those 
ranking lower. 

When the normal limits of X and Z status, as defined by 
Group B, are considered, the original art status of the Group A 
subjects, as approximated at the beginning of the study, is cor- 
rected in two instances, namely 8 and 11. ‘These subjects, 


originally classed as X and Z respectively, are both listed in 
Table V as Ys. 


TABLE V. Net change and final status of subjects ranked in order of 
original status 


Corrected First Final 
Original Period Net Art 
Subj. Status Score Change Status 
10 X 3.88 —.38 near-X (3.30) 
6 xX 3.70 .14 X (3.84) 
4 near-X 3.31 .19 X (3.50) 
8 ¥ 2.94 76 X (3.70) 
11 ¥ 2.39 73 Y (3.12) 
3 Z 2.03 1.33 X (3.36) 
2 Z 2.02 .68 Y (2.70) 
14 Z 1.82 1.55 x €3.37)} 
13 Z 1.60 1.31 Y (2.91) 
9 Z 1.48, 1.06 Y (2.54) 
5 Z 1.48) 1.65 ¥ €3.333 
7 Z 1.42 1.09 Y (2.51) 


The final art status of the subjects, also shown in Table V, 
was obtained by combining with the original position (first 
period score) the net change made by each. By this method 
subjects 3 and 14, originally Zs, have changed to X status. All 
other Zs have been raised to Ys.° This is shown graphically 
in Fig. 5. 

(2) The identification of esthetic sensitivities: (1) Through 
Group A. Table VI shows a summary of clinical observations 7° 


9No claim is made that the art ability-status of any individual can be 
adequately listed at a finely drawn figure. In general, however, the conditions 
represented in Fig. 10 approximate the true conditions,—a statement borne out 
by a.separate study of the subjects and of their products. These products are 
on file at the Psychology of Art division of the Department of Psychology. 

10 The complete record of observations is on file at the Psychology of Art 
division of the Department of Psychology. 
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Fic. 5. Amounts of change in status of Group A subjects during two 
years of stimulation 


accompanying the two-year program of stimulation. Since this 
program, described previously was somewhat constant for all 
subjects, only the individual difficulties and special treatment 
are shown [p. 146]. 

Certain psychological conditions were noted as peculiar to each 
Z subject individually. Lack of interest at the start on the part 
of the Zs was general and at the final period the interest level 
was higher in almost every case. A corollary to this is found 
in the high level of interest of all Xs at the start. These subjects 
were not marked with the difficulties that beset the Zs. 

Subjects 5, 7, and 9 were boys from typical rural homes,— 
the only rural homes represented. In each of these cases the 
homes might be classed as normal, but none of the three pre- 
sented a favorable zsthetic environment for the child. In each 
situation the subject’s time at home was occupied to some extent 
with chores. In each case the parents were either unwilling or 
unable to realize the value of zsthetic influences in the home. 
One of these three subjects, No. 7, was studied by Rodgers, 
who found the home to rank the lowest esthetically of any of 
the Iowa City homes she studied. It is to be noted that the 
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special treatment of these subjects was not completely effective. 
The only other subject in this category (No. 13) was also a 
boy, but of an urban and attractive home. 

As a general observation of the original difficulties of the 
average Z subject, it can be said that the most frequent trouble 
was the apparent inability of the subject to express himself 
easily in pictures. Lack of observation and lack of an efficient 
visual memory and of imagination seemed to be causal factors. 
Some subjects seemed simply to lack practice in drawing; others 
who had previously had abnormal amounts of practice still were 
unable to draw satisfactorily. The realization of these condi- 
tions by the subjects was emphasized by the common practice, 
in the beginning period, of choosing to draw subjects that had 
been drawn before with some degree of success,—usually sub- 
jects they had learned to draw in the regular classroom. In 
the majority of cases the inability to effect successful graphic 
expression, found in the beginning period, resulted in an atti- 
tude of play, or of disinterest and carelessness, or of helplessness. 
These abnormalities of attitude tended to disappear as the sub- 
jects experienced success in expression, the commendation of 
others, and as they gained confidence. 

The kind of art medium used was observed to be a definite 
factor in the possibilities of success in drawing and painting 
of the Z subjects. Graphic media with hard or sharp drawing 
points or edges were found to give discouraging results; soft, 
colored chalk, clay, and paint in jars and large brushes were 
noted to be more encouraging to the subjects. They seemed 
to require less skill in handling, and produced an attractive, 
vigorous effect. 

The types of motivation used were observed to be effective to 
some degree in the case of each subject, varying in direct rela- 
tion to the developed interest. Of the types used, praise of the 
subject’s best products and the related factor of exhibition of 
the subject’s work, seemed to be the most effective. Visits to 
exhibitions of oil paintings increased in subject interest from 
a mediocre level to a higher one. Visits to the studio were 
effective in some, but not in all cases. 


(2) Through Group B. The results of reactions of Group B 
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to the art situations presented in the esthetic sensitivity mate- 
rials provide an analysis of some factors of esthetic sensitivity. 
The results are presented in Tables VII and VIII. 


TaBLE VII. Comparative Performance of X and Z subjects (group B) 
on esthetic test items grouped according to art situations 


Grade 1 Grade 2 Grade 3 Grade 4 
Xx Z xX Z X Z xX Z 
Balance 
(7 items) 
A.M. 3.78 3.48 4.00 4.64 5.2 €.92 5.85 4.00 
S.D. 1.08 1.76 ee ° Tae ee 1.56 1.09. .837. 1.16 
S.E.m. a) waa . eee 403 .302 on .aee 
Rhythm 


(11 items) 
A.M. S.fe. Scan 6.22 6.50 S.2: 7.45 io #@.a0 
S.D. 1.30 1.47 1.66 1.55 13s 8 62.82 1.54 1.90 
S.E.m. 00 .aee 391 414 .473 588 .427 ~—-. 491 


Proportion 
(18 items) 
A.M. 10.17 9.81 iz... F2 13.93 10.85 14.23 12.73 


~~ 


[a 1 Sa. - 2 Se 26 28 903 1.87 
S.E.m. .280 =.519 .636 .641 682  .691 .250 .483 
Color Harmony 
(6 items) 
A.M. 2.44 3.00 a.As cee 4.40 2.92 4.38 3.67 
S.D. 1.54 1.60 1.6/. E66 1.08 1.15 945 .773 
S.E.m. ee .394 .444 .279 ~—-. 319 .262 .200 
Arrangement | 
(13 items) \ 


A.M. 6.89 6.57 7.56 6.36 foe 10.46 8.40 


S.D. 2.40 1.66 1.93 1.94 1.51 2.19 1.91 1.36 
S.E.m. 000 455 518 .390 = .607 200 aon 
Subject N= 18 21 18 14 15 13 13 15 


The 55 choices comprising the esthetic sensitivity materials 
presented five classes of art situation,’’ namely balance, rhythm, 


proportion, color harmony, and arrangement, divided in number 
as follows: 


balance 7 
rhythm 11 
proportion 18 
color harmony 6 
arrangement 13 


11 Since it is practically impossible to divorce one art element entirely from 
all others, each of the items of the test presented the variable of one element 
largely to the exclusion of the others, and is listed under the name of the 
element which it thus represents to greater degree. 
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Tas_e VIII. Significance of the differences of the mean performances of X 
and Z subjects on esthetic test items (from data in Table VII) 


Degree of Signifi- 
cance— (Chances 


,. De in 100 that 
Diff. o Diff. o X is better than Z ) 
Balance 
Grade 1 30 —- — not signif. 
2 —.64 .478 —1.38 92* 
3 .28 — — not signif. 
4 1.85 .379 4.88 significant 
Rhythm 
Grade 1 40 443 .903 82 
2 -——.28 — — not signif. 
3 1.12 755 1.48 93 
4 32 .651 <a 79 
Proportion 
Grade 1 36 — — not signif. 
2 2.24 .903 2.48 99 
3 3.08 971 3.8 significant 
4 1.50 .544 2.78 significant 
Color Harmony ) 
Grade 1 —.56 .504 —I1.11 87* 
2 —.12 — — not signif. 
3 1.48 .424 3.49 significant 
4 71 .330 2.15 98 
Arrangement 
Grade 1 32 — — not signif. 
2 1.20 .689 1.74 96 
a 1.96 721 2.72 significant 
4 2.06 .636 3.24 significant 


* Reversed condition; chances in 100 that the Z group is better than the X. 


The performance of the 64 X and the 63 Z subjects of Group 
B is shown in Table VII in the mean standard deviation and 
the standard error of the mean of the scores on the grouped 
items making up the five types of art situation. The difference 
in performance of the X and Z subjects as classes is shown in 
Table VIII, where the significance of the differences between 
the X and the Z performance is shown. The standard error 
of the difference between each pair of means 1s arrived at by 
the use of the formula: 





a = 3 
o diff.=\ O(M,) O(M,)” 


where M, is the mean of the X scores and Mg is the mean of 
the Zs. This is a standard formula for uncorrelated measures. 





| 
: 
; 
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The significance of the difference between and X and a Z average 
is shown by the usual formula: 


signif. of a ae 


o diff. 
This results in a measure of significance expressed by chances 
to 100 that there is a true difference between the averages. Sig- 
nificance is here accepted as 2.5 0 which is 99.38 chances in 100. 
All differences greater than this are marked ‘significant’ in 
Table VIII. 

The fact that a difference is not shown to be significant does 
not necessarily mean that a true difference between the subjects 
in this respect does not exist; it merely leaves the problem 
unsolved as far as the present measure is concerned. 

Table VIII shows the tendency of a lack of significance 
between X and Z performances in Grade 1 in all the types of 
art situation (except rhythm)'* and a growing presence of 
significance as the older grade levels are reached. 

VII. Summary and conclusions. This investigation was under- 
taken to examine the fixity or stability of art ability—to test 
the possibility of change of art ability, and to identify the factors 
and conditions related to any such change. ‘The approach was 
made through a group (Group A) of 10 inartistic and four 
artistic children, the majority’ of whom were stimulated during 
two school years by a special program of instruction including 
free expression and motivation. Measurements of the art ability 
of this group were made at the beginning, mid-point, and end of 
the program at the rating, by a group of qualified judges, of a 
number of each subject’s paintings which were made under 
controlled conditions on a prepared scale. 

The factors of esthetic sensitivity differentiating children 
artistically superior and artistically inferior were identified to a 
certain degree by the comparison of responses of a large number 
(Group B) of normal children, previously measured in artistic 
ability, to zsthetic situations of five types. The scale, through 
the use of which the paintings were rated by groups of judges, 


12.No explanation is apparent for the approximate constancy of near- 
significance in all grades, except Grade 2, in rhythm. 
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resulting in estimates of pupil art ability status, was itself a 
product of ratings and rankings of a larger group of highly com- 
petent and widely recognized artist-art educators, distributed geo- 
graphically, who judged samples of paintings from each value 
level of the four grades (409 subjects) constituting Group B. 

The results of the two years of special stimulation show that: 

1. Of the nine subjects classified as artistically disinterested 
(Z) or low average (Y) subjects, all exhibited definite and in 
six cases striking improvement. 

2. Of eight presumed Z-status subjects one was found by 
refined methods to have low Y-status, and two, borderline 
Z-status. There were, hence, five definitely Z-status subjects. 

3. Of eight subjects having Z or borderline status, all being 
considerably below the median of their age and grade, four at 
the end of two years approached, approximated, or attained bor- 
derline X-status or definite X-status. 

4. Of the two Y-status subjects, one remained a Y but showed 
a change from a low Y toa high Y; the other rose from slightly 
above average to a high X-status. 

5. (a) In general it is observed that subjects of Z-status 
showed a change leading to Y-status (five out of seven cases ) 
and X-status (two out of seven cases). 

(b) In all cases it was demonstrated that improvement was 
possible. This improvement extended in most cases from Z to 
Y or from Y to X, and in two instances from Z to X. 

6. It is therefore believed that a radical change in level of art 
ability is possible under especially favorable conditions; in the 
majority of cases, however, the utmost to be confidently expected 
normally would be a change from Z to Y, or from Y to X. 

7. The amount of gain made was, to some degree, in direct 
relation to the amount (time) of stimulation received, and to the 
degree of inferiority at the start. These conditions support the 
first conclusion in that they show a relationship between change 
in art ability and stimulation. 

8. Qualitatively, various conditions appear to be causal in the 
instance of inferiority, which, being corrected or changed, allow 
the individual to become normal or even superior. Typical 
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conditions defined by this investigation are: inartistic and 
unfavorable home conditions (this is supported by the Rodgers 
study, 49) ; inexperience in drawing or experience of failure in 
drawing ; the use of improper materials; lack of motivation; lack 
of sensitivity to elements of artistic quality (balance, proportion, 
color harmony, and arrangement). 

9. The lack of zsthetic sensitivity, as shown by the methods 
and materials of this study, is practically the normal condition 
at the Grade 1 level, sensitivity becoming more evident as a dif- 
ferentiating agent in the subsequent grades, resulting in Grade 4 
in a significant or nearly significant difference between inferior 
and superior subjects in all of the art elements considered. 
Facility in artistic expression at these ages appears to be favored 
by this factor when found in connection with favoring physical 
conditions, esthetic surroundings, reasonable motivation, and 
confidence. 
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ART ABILITY WITHOUT INSTRUCTION OR 
ENVIRONMENTAL BACKGROUND: CASE 
STUDY OF LORAN LOCKHART? 


by 


NorMAN C. MEIER 


I. Introduction. in January, 1934, through the publication of 
a feature story in an lowa newspaper, our attention was called 
to the case of a child, Loran Lockhart, seven years of age and 
blind from birth who, having had sight afforded by operations 
and who then began to produce drawings unusual for his age, 
one of which was reproduced in color in the article. Because of 
the obvious bearings of such a unique case on the theoretical 
implications of the question of the heredity of drawing ability, 
the case was investigated. The facts as ascertained were as 
follows: the defective vision, which amounted approximately to 
total blindness, though the child could dimly discern daylight 
from darkness and the general presence of a lighted lamp before 
him, was caused by a congenitally imperfect visual apparatus 
including nystagmus and complete double cataract. The latter 
was subjected to a series of operations, mostly of the needling 
type, with the expectation that the fluids would clear up the 
cataract substances. Not finding any reaction of this type 
developing, the surgeon resorted finally to the expedient of 
removing both natural lenses, thereby giving the subject a fair 
degree of vision through the aid of two sets of special cataract 

1 This unusual case was studied at Loran’s home, his school and other piaces 
in the vicinity ; and at the hospital and surgeon’s home in Springfield, Missouri, 
by the author and his Research Associates, Carolyn Tiebout and A. W. 
Saunders. The conclusions and theoretical considerations are those of the 
author. Owing to the relative inaccessibility of the subject, the study neces- 
sarily was limited in scope. During the year and a half since contact was made 


with the subject the study has continued by correspondence and examination of 
the subject’s artistic productions. 


155 











156 NORMAN C. MEIER 


glasses, one for ordinary vision and one for reading. The 
nystagmus still persists without observable improvement. 

When studied, the child had been going to school in a one 
room country school for about four months, half of this time 
while blind, with the aid of an older sister. He was still unable 
to read ordinary type and was making progress in certain sub- 
jects with difficulty by means of large-type books loaned by the 
Missouri State Commission for the Blind. He was nevertheless 
able to do third grade arithmetic and was doing work in other 
subjects comparable to his age except in spelling. ‘Out of doors 
his other set of glasses afforded him fair but not wholly adequate 
vision ; yet it permitted him a certain range of activity. He was 
able to run, play most outdoor games, and walk with ease. Many 
of his former habits of touch in aiding prehension and manipu- 
lation of objects still persisted,,as well as a characteristic ‘ feel- 
ing’ step while walking. About his home, which was a one 
room log cabin and in which the two parents and five children 
lived,” he was able to exercise a few surprising activities which 
included riding a billy goat. 

It was therefore necessary to verify certain presumed facts at 
once. First, the fact of presumed competence in drawing ; second, 
the possible explanation for this ability either in (a) opportunities 
for realizing the benefits of instruction, or, (b) conscious or 
chance aid from anyone coming in contact with the subject who 
might have given him suggestions for his efforts; third, the 
influence of the environment visually experienced during the 
brief time elapsing since he saw for the first time.” 

In order to test the first assumption, that of whether or not 
this was a case of genuine ability, an easel, similar to the kind 
used in various studies with children of his age, was brought 
along and set up in the home. He was supplied further with 
colored crayons, colored chalk and tempera paint and brushes, 


2 The school is located about 40 miles north of Springfield, Missouri, in the 
Ozark country, 14 miles from any town of consequence and about the same 
distance from any state highway or railroad. The home is half a mile from 
the school in the midst of an oak woods requiring a trip of three miles by 
automobile. 

3 This period has been estimated at not more than three months prior to the 
time of the study, which was made February 5 and 6, 1934. 
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as well as a supply of white book paper, 16” x 22”. A number 
of paintings were made in the presence of the investigators and 
Leica photographs made at intervals of the boy at work, provid- 
ing a progressive record of his drawing procedure and unusual 
manner of work, because he is left-handed and works close to 
the board. Since that time about 100 additional paintings have 
been received in Iowa City and studied. 

In addition, the mental processes of the child utilized in con- 
.structing the picture, reproduced in the lowa newspaper feature 
article, were investigated and the sources of the content of that 
picture sought. The picture was traced to a farm house and yard 
of an uncle living about two and a half miles away, at which 
plate photographs were also taken for purposes of comparison 
with the picture produced by the subject. The picture represents 
a composition utilizing objects seen but manipulated for the pur- 
pose of producing an interesting organization. It was, therefore, 
not a realistic copy of the farm-yard but an organized arrange- 
ment that, at least from most standards, would qualify as good 
composition considered either from the child or the adult view- 
point. An analysis of the 100 paintings since received, warrants 
their classification as the equivalent in every way of those of the 
average normal child having the benefit of seven years of visual 
experience, except that in content they are limited to the restricted 
visual experiences of the subject. In freshness of color, in ade- 
quacy of impression and statement, they are in many instances 
unusual. In some instances, his forms are superior; in other 
cases they are bold attempts, but show the lack of technical pro- 
ficiency necessary to accomplish something which would be diff- 
cult for any child. It is, therefore, a matter of fact that the art 
ability shown by the subject is at least equal and possibly some- 
what superior to that of normal children of his age. 

IT, Environmental aspect. Environmental influence, con- 
sidered in this connection, may be of two principal kinds: sub- 
jective, or the influence of physical environment,—chiefly nature, 
buildings, or objects of any kind. Books, pictures, newspapers, 
or magazines may be classified under either caption. Since no 
art instruction was offered in the country school, and the teacher 








a Pe ke Ee PE TE 


aa RE 


ee eee 


A 
a 
BY 
Cae} 
x 
a 
a 
ia) 
4 
\ 
% 
e® 
4] 
A) 
* 
” 
¥ 
Pe 


158 NORMAN C. MEIER 


was not qualified in that field, nor was there any student among 
the 21 attending the school who could draw as well as Loran, the 
conclusion was drawn that aid had not come from that source. 

The hospital attendants in Springfield were emphatic in their 
statements that no aid could have been received from them, nor 
did they know of any patient there, during the brief time follow- 
ing the removal of bandages, who could have given instruction. 
Almost the entire time the subject had been in the hospital, and 
during his visits he was without the benefit of vision. 

Inquiry and examination of the books and other printed matter 
in the home threw no light on any possible influence of that kind. 
No books were seen there other than the Bible and one other 
volume; no illustrated periodicals, a few newspapers, and little 
else. The belief that no influence of this kind was operating 
was further supported by the observation that practically the 
entire content of the subject’s paintings concerns the immediate 
surroundings about the home and objects in the subject’s own 
immediate experience such as the cabin, trucks, his goat, cat, and 
chickens. The buildings about the home are of a plain type, and 
the farm is meagerly stocked with several cows, several horses, 
and two goats. Although the subject had been to Springfield 
for medical attention, it was ‘only during the final trip that he 
was able to see his surroundings plus one other trip on Christmas 
day, six weeks prior to the study, when he saw his first movie 
and, of course, on this occasion and on his return home, motor 
cars, buses and other objects. 

It is, therefore, extremely unlikely that the environment 
visually experienced could have exerted any considerable influ- 
ence such as might account for the production of his artistic 
products. 

III, Results of tests. The other possibility, which might 
explain the accomplishment in part, was assumed to be that of 
superior sensory capacities and general intelligence operating 
through some native ability in manipulation. Accordingly intel- 
ligence tests, one for the blind and one for normal subjects, were 
administered. In both of these tests he showed superior intelli- 
gence. In the Hayes-Binet test for the blind the score indicated 
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an IQ of 132; in the regular Stanford-Binet test the IQ was 116. 
Since the subject was no longer blind, having had the benefit of 
vision for about three months, the Hayes-Binet figure is probably 
too high, and insofar as he lacked the normal experience of a 
visually normal individual save for a brief period, the Sianford- 
Binet result is too low.* On the Goodenough Drawing-of-a-Man 
test, however, the result showed an equivalent of an IQ of 102. 
This apparent discrepancy can be reconciled in the light of other 
test results. 

The Heilbronner apperception test disclosed that the subject 
was unable to recognize familiar forms presented in progressive 
stages of completion. ‘ Pencil”? came to him at the third pres- 
entation; this was the only one successfully recognized (see 
Psychol. Monog., 45, 1933, No. 1, pp. 117-119).> Sample 
responses are as follows: incomplete broom symbol was ‘ bell’ ; 
saw, by progressive stages, was ‘cow,’ ‘cow,’ ‘cow,’ ‘cow’; 
wheelbarrow was passed, up to the fourth progressive presenta- 
tion, at which it became ‘sled,’ then ‘ wagon’) ; candlestick was 
responded to as ‘ pitcher’ on the first presentation, followed by 
silence up to the fifth when it was again assigned ‘ pitcher.’ 

On the Knox Imagination test his responses were normal but 
not distinctive, except for one response to Blot No. 3. 

The Form Discrimination tests (adapted from Kirkwood) 
disclosed a 54 per cent correct response. This performance is 
little better than a Z classification, and may be a result of failure 
to see well enough to make the necessary discrimination. What- 
ever deficiency in normal visual acuity was operating did not 
prevent him from making an average score on the Lewerenz 
Color Recognition test. In this, however, the color patches are 
relatively large in area. The J/slihara Color Blindness test 
showed complete normality. 

The group Afsthetic Sensitivity test ® was given in part but 


4JIn the opinion of Dr. Tiebout, who administered these tests, his true intelli- 
gence would be represented by a mid-value, such as 125, which would assign 
him definitely to the category of superior intelligence. 

5 A description of these and the tests immediately following may be found in 
this reference. 

6 Assembled by Saunders and described by him elsewhere in this volume. 
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discontinued when it was found that the material was not seen 
sufficiently well to enable him to make distinctions. 

The Kohs Block test was successfully administered and dis- 
closed a strong eight-year performance (score 2/7). The 
particular significance of this test, which requires mental manipu- 
lation of colored facets of blocks, is not definitely known, but 
it is highly probable that persons competent in art ability would 
show advanced performance. 

On picture analysis test ‘ he was almost completely at a loss, 
primarily for the reason explaining his showing’ on the apper- 
ception test, namely, a fragmentary perceptual background. 
Bridges, boats, buildings, unusual color treatments—these were 
all strange and incomprehensible to him, if his reactions were 
judged correctly. Apparently he had no basis in experience for 
recognizing scenes and objects foreign to his Ozark environ- 
ment of three months’ duration. Even objects of nearly every 
4-year old child’s normal experience were still outside the limits 
of his perceptual range. 

While greatly deficient in perceptual background, the subject 
showed highly superior auditory capacity or rather highly 
developed habits of attention for auditory stimuli and conse- 
quently high proficiency in activities utilizing hearing. His 
advanced standing in arithmetic was attributed to this advantage. 
Having heard older children* going through arithmetic drills 
at home he had given considerable attention; in that manner had 
‘picked up’ the basic combinations perhaps even before entering 
the first grade. Evidences of these superior habits of auditory 
attention were observable on many occasions. At one time, while 
painting with four adults present, he was able to interject 
remarks into a conversation going on at the opposite side of the 
cabin, at the next moment shifting quickly to say something 
to those nearby. 

The exact manner in which vision functions in Loran’s case 
was not ascertained. A study of his pictures, however, reveals 
that unquestionably he has acquired some semblance of space 


7 Using small color reproductions of works of Van Gogh, Gauguin, Monet 
and others. 


8 Loran is the youngest of five children. 
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perception, although most of his pictures have a flat character 
which is not uncommon for children of his age who have normal! 
vision. This condition is one attributable more to lack of rep- 
resentative skill than to deficiencies in spatial perception. Since 
we were not present when his first adequate visual sensations 
were experienced, it can only be conjectured that his non-spatial 
outlook was gradually constructed at first through conjunction 
with kinesthetic and tactual experiences which soon thereafter 
were greatly accelerated through his restless activity and superior 
intelligence. His ability to catch a ball and to represent success- 
fully spatial relations in his drawings indicate that nine weeks 
following the removal of bandages he had a reasonably well 
organized facility in activities requiring a fair spatial perception. 

IV. Hereditary antecedents. It is, therefore, to be observed 
that in this unusual case there is art ability without instruction 
or any discoverable environmental influence adequate to account 
for the production of such drawings and paintings as have been 
executed. Two explanations of this apparent artistic ability 
appearing without the usual bases suggest themselves. One is 
that his view through his newly discovered sense of sight pro- 
vides him with fresh and extremely vivid impressions which, 
through his superior intelligence, he is able to express on paper 
in a normal or even superior manner. This probably accounts 
for the fact that his compositions include in some instances farm 
scenes viewed days or weeks previous to the drawing. The 
attendants at the Springfield hospital characterized his first 
impressions as being especially and keenly vivid in the color of 
various objects seen. He was particularly thrilled by motor cars 
and trucks and exclaimed at length about their colors, the names 
of which he apparently picked up very quickly. In other instances, 
he was able to see in full-blown detail, objects which, in the 
experience of the normal child, usually undergo a process of 
gradual alteration from crude beginnings. A normal child of 
three years sees a train as a group of moving boxes [drawn as 
rectangles| and it is perhaps not until the fifth or sixth year 
that the concept of train visually experienced and _ visually 
acquired, may attain an approximation of reality. In the case 
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of Loran these objects were seen all at once and entered upon 
association areas which were not already cluttered up with thou- 
sands of other visual impressions in a more or less adequate 
state of development. 7 

The other explanation concerns itself with the presence of 
matured drawing skill of average or better than average char- 
acter for the age involved. This explanation is supplementary. 
It assumes that there is some kind of morphological inheritance 
of stock in the subject’s case which lends itself readily to the 
easy acquisition of motor skills, without postulating any direct 
inheritance of a specific character but merely that of a 
constitutional nature. 

It has long been held by the author as a tentative hypothesis 
that art ability, other things being equal, is apt to be found in 
anybody whose ancestry includes individuals living through 
many generations in sections of the world where handicraft 
activities have long characterized the principal occupations. Such 
areas have been referred to by Giddings, or rather the type of 
people populating such areas, as familial aggregations. A con- 
siderable body of data collected by the author over a period of 
20 years, strongly supports this view and explains in part why 
such high proportions of Almerican artists, engravers, lithog- 
raphers, instrument makers, and others are descendants of 
persons having lived in the upper Rhine country, Switzerland, the 
upper Danube, and other sections where handicraft activities are 
prominent. The selective factor is, of course, sexual: the most 
successful through many generations having attracted the better 
class of mate. Hence in descendants these same skills, which 
were, through the constitutional stock, undergoing a constant 
selection, now reappear in proficiency in fine carpentry work, 
cabinet making, general craftsmanship, furniture making and 
finishing, and the like. 

Inquiry into the antecedents of the subject produced informa- 
tion of the significant fact that an uncle on the father’s side was 
a finishing carpenter, cabinet-maker, and general repairman, or 
to quote directly from the source of the information, “I saw 
one kitchen he transformed into a perfect picture you see in 
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magazines of modern kitchens.” Aside ‘from the information 
that an uncle was a violinist, the information is meager. On 
his mother’s side the ancestry allegedly includes Indian blood. 
This part of the ancestry could not be traced occupationally. 
Since, however, some of the Indians of the southwest are artists 
of no mean ability, these facts do not necessarily militate against 
the acceptance of the hypothesis that there may even be some 
favoring morphological inheritance from this direction. In the 
absence of more definite facts it must be concluded, in line with 
causes regarding social and physical environment presented earlier 
in the article, that the inheritance of morphological character- 
istics, lending themselves to an easy facility in drawing activities, 
would seem to be the more likely explanation of this extraor- 
dinary case. There is some support to the belief that art ability 
is tied up at least that much, with inheritance of general con- 
stitutional patterns. Possibly the true explanation would include 
the possession of superior intelligence with fresh and vivid 
visual perceptions acting upon such a constitutional inheritance. 











AN INSTRUMENT FOR THE STUDY OF CREATIVE 
ARTISTIC INTELLIGENCE ? 


by 


NORMAN C. MEIER AND WILLIAM McCtLoy 


Creative mind, despite its unquestioned value to society, has 
only infrequently been an object of serious study. In the absence 
of adequate means for recognizing its early manifestations, 
society has had to depend upon fortuitous happenings to disclose 
the occasional genius. The creative artistic individual, it may be 
observed, is destined to be of incalculable importance in a world 
visually minded and so largely responsive to graphic stimuli, 
whether these be in the form of caricature, the satirical paint- 
ing, social mural, or the manifold appeal of advertising art in 
the poster, newspaper, periodical, or billboard. 

With a view toward studying the child and youth who has 
evinced relative superiority in some form of graphic art, in a 
way that would disclose possibilities of performance in ways and 
to an extent not heretofore feasible, the device described below 
was constructed. It is primarily an apparatus capable of dis- 
closing variation in individuals in their susceptibility to graphic 
organization. Such an apparatus could only indicate possibili- 
ties; it could not guarantee that they will be realized, any more 
than an intelligence test, having disclosed superior intelligence 
in an individual, could provide any assurance that he would use 
it. The device, nevertheless, is capable of providing a means 
for disclosing variations in creative ideas so great that no living 
person could exhaust them. 

Among teachers it is frequently held that the only way to 
discover ability is to have the person take the course in freehand 

1 Designed for use in the research program, Genetic Studies in Artistic 


Capacity, sponsored by the Carnegie Foundation for the Advancement of 
Teaching, with financial support from the Carnegie Corporation of New York. 
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drawing. If he has ability he will show it. This procedure 
neglects the possibility that he may not be motivated to do his 
best and that his creative side may not be invoked at all by 
technical exercises. Others believe that no judgment can be 
rendered until such time has passed when, his training completed, 
he has had ample opportunity to ‘ find himself.’ 

The entire picture is necessarily beclouded because of the fact 
that ‘creative’ is a term with many meanings. By some it is 
used when ‘ productive’ is meant, as in the case of a student 
working on ‘ Antique’ who is said to be doing ‘ creative work.’ 
Another familiar use considers ‘creative imagination’ to be 
imagination that spins new forms out of material voids—out 
of ‘the mind’ rather than from experience. 

By ‘creative’ we shall refer to a special kind of organization 
of known forms, or more specifically, the arrangement or modi- 
fication of ordinary materials or objects in such a way as to 
produce in the end a unique, zsthetically significant, and generally 
interesting composition. 

Any_peculiar or unorthodox arrangement is not necessarily 
‘creative art’ any more than the ‘ baby talk’ of small children 
is literature. It must meet, to some degree, the common expe- 
rience of mankind. That is to say it must have attained at least 
one of the qualities of zsthetic value which are integral with 
man’s existence, such as unity, balance, rhythm, or sequence. 
These of course are not there for their own sake—they serve 
to make the forms effective and moving, giving life and vitality 
to the organization. The creative artist attains this end ordi- 
narily by automatized technical skills conditioned in varying 
degree by disciplined zsthetic judgment. 

To those unacquainted with artists and their procedures in 
painting, the idea that many of them systematically and carefully 
plan every step in a composition would seem incredible. Many 
persons classified as ‘creative’ artists, work just that way. A 
given idea or theme for a painting may go through the pro- 
cedure of a ‘rough idea’ sketch; five or more preliminary 
‘organization’ sketches; brown paper and charcoal layout for 
final organization; charcoal development for values; detail 
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sketches ; and final layout and development on canvas. Each of 
these stages may be varied. It is in connection with the accept- 
ance or rejection of each new phase of the organization that 
zesthetic judgment is required’on the part of the artist. On the 
wisdom or unwisdom of such decisions depends the progress 
toward success or failure of the composition. 

It must be observed that even in naturalistic representation 
there is always some opportumty for modification of the model. 
The flower may be made slightly redder, the vase somewhat taller ; 
the mountain higher and the sky bluer or of arbitrary hue. In 
strictly creative art, however, the artist is bound only by the 
requirement that what he ends up with be zsthetically moving— 
significant. He can manipulate line, color, form, at will—of 
course within the limits of good sense and the restrictions of 
zesthetic values. 

The essential characteristic of the typical situation confronting 
the creative type of artist is variability. Not infrequently the 
opportunity for change, for rearrangement, for modification of 
features or detail, or any of the numerous focal points of manipu- 
lation, is virtually unlimited. Being so, the chances for error 
are multiplied; demands upon the judgment processes are thus 
increased. It is probably because of this that the creative artist 
is regarded in the category of 'the inventor and literary genius. 

An apparatus to offer possibilities for discovering individual 
variations in creative ability would need to provide, therefore, 
wide variability of manipulation. An apparatus was accordingly 
projected somewhat on the order of a miniature stage, which 
would offer variability in form, background, color and lighting 
(see Fig. 1). 

A table 25 in. high with a top area of 30 in. x 42 in. was built 
of maple. On this was placed a frame 30 in. high, 30 in. deep 
and 36 in. wide (outside dimensions). Across the lower front 
of the frame a battery of alternate red, yellow and blue lamps 
was installed and concealed from view by a tin reflector which 
directed the light into the frame. A similar battery of red, 
yellow and blue lamps? was installed inside the upper frame 


2 Experience in operation dictated change from the original color arrangements. 
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area, likewise concealed from view and with the reflector direct- 
ing the light within and slightly downward. Both of these bat- 
teries were built on movable supports which permitted the lights 
to be shifted. All lamps were 10 watt except 10 of the 20 blue 
lamps which were 7% watt. All red lamps were wired in 
series-parallel and led to a central control board where they 
were connected with the line and a Variac Type 200-B (1 ampere 
capacity) auto-transformer.’ At the control board arrangement 
was made for separate plug-ins for the upper and lower bat- 





Fic. 1. Creative composition apparatus. The background projector 
and photographing stand are not shown 


teries so that either the upper or the lower bank could be used 
independently or both could be used at once. The yellow series 
and the blue series were similarly wired, except that the blue 
banks, having a maximum wattage of 175 were connected to a 
Variac Type 200-C (5 ampere capacity) auto-transformer. The 
low brightness value of the blue bulbs was not brought to the 
equivalence of the yellow by the addition of 10 extra lamps, but 
was regarded as fairly satisfactory for the purpose. 

By means of the series-wiring and the very sensitive auto- 
transformer * any gradation of these primary colors was made 


3 General Radio Company, Cambridge, Mass. 

4 The auto-transformer differs from the ordinary rheostat principally in the 
tnagnitude of the steps in the reduction or increase of the current, the former 
being wound and constructed for extremely gradual changes. ‘ 
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possible. By operating the dial of two or even three auto- 
transformers simultaneously an infinite variety of subtle color 
mixtures and intermediate colors can be obtained. ‘The color of 
‘rosy dawn,’ ‘ gray mist,’ ‘midday heat,’ ‘cold bluish snow,’ 
‘purple depths,’ ‘green vegetation,’ and practically any other 
color of nature or of compositional setting may be duplicated. 

Opaque curtains of black sateen, hung by rings on wire, were 
mounted at the sides and across the front. These, however, 
could be adjusted to admit of full or partial opening. Inside 
a small platform 4 in. above the table top and 18 in. x 25 in. 
in size was mounted on castors. On this was to be placed the 
clay-modeled forms for use in the studies to follow. 

Outside the frame, mounted on floor stands were one ‘ semi- 
flood’ lamp and four ‘baby-spots.’ The ‘semi-flood’ lamp?” 
was mounted on an adjustable stand that would permit project- 
ing a flood from above at a sharply acute angle or an intense 
side light at ‘ground’ level. This and all other outside light 
sources are portable and project light from almost any angle 
and at any given intensity.°® 

Three ‘ pencil-spots’ were: constructed‘ from 5-cell flash- 
light cases, a 6 volt Jefferson transformer obviating the need 
for the usual dry-cells and permitted them to be wired together 
to an auto-transformer for the control of intensity. An inter- 
mediate or diffuse spot was built into a box, using a 2% in. 
lens and 30 watt lamp. 

By fitting cellophane masks over the lenses on any of the 
portable spot-lights, a wide variety of color could be introduced, 
and by such means, applied to any point or small area on the 
clay form within the frame. 

The ‘semi-flood’ light was connected to a 5 ampere auto- 
transformer, enabling the operator to suffuse the clay-modeled 

5 This lamp is a Major Baby Spot-light with 4 in. lens and 400-watt lamp, 
pre-focus. Because of the comparative Lilliputian proportions of the miniature 


stage, a commercial Baby Spot takes on the proportions of a semi-flood light. 
6 The ‘semi-flood’ up to 400-watts and the ‘pencil’ spots up to 100-watts 
maximum. 
7By McCloy. The writers are indebted throughout to Professor Hunton D. 
Sellman of the Lighting Studio of the University Dramatic Arts Department 
for invaluable guidance in all lighting arrangements and selection of equipment. 
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scene or objects with a soft, almost imperceptible glow or the 
full radiance of mid-day sun. The pencil-spots were useful for 
such purposes as casting a tinge of color on a modeled figure 
or to illuminate a foreground, or throw a purplish cast on a 
mountain cliff or peak or canyon wall. 

When desired placements had been effected, all of these lights 
were under control from the five dials of the transformer on 
the central control board. The operator could turn them at 
will, observing the result of subduing some, intensifying others, 


changing or modifying colors or creating accents or emphasis 
where desired. 


Back of the frame at a distance of 50 in. was placed a 1,000 
watt Spencer Delineascope * mounted on a stand, the objective 
being 33 in. above the floor. The back of the frame was cov- 
ered by a sheet of vellum tracing paper, onto which any desired 
sky, cloud forms, trees or other background materials could be 
projected by means of the delineascope. Since the delineascope 
occupied a darkened recess in the room its projected image was 
seen in natural order from the front of the frame, that is, from 
the control board position. A background could thus be fused 
with the base form and lighting from all sources, into a unit 
composition of surprising fidelity and realism. It was thus 
feasible to get almost unbelievable beauty, and to produce effects 
that were strikingly like those in nature. 

In addition to the apparatus and equipment just described, all 
of which may be regarded as permanent, the base forms remain 
to be considered. It was planned to have within the frame, as 
a basis for each unit problem or object of study, a base-form 
which would be the nucleus of the general composition. 

These were built up of Plastaline Clay which has the prop- 
erty of keeping its modeled character indefinitely and having the 
additional advantage of being readily reworked at any stage. 
It furthermore is available in a variety of. colors. Synthetic 


8 Spencer Lens Co., Buffalo, New York. This one has a maximum picture 
opening larger than standard. 

® That used was Plastalina, mostly No. 1 gray (F. Weber Co.), and 
Plastaline, green-gray (Favor Ruhl & Co.). Colors (Weber) were green, 
brown, yellow, terra cotta and blue. 
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landscapes were constructed with a covering of clay over false 
structures (using small pieces of wood) somewhat after the 
fashion of constructing ‘sets’ on.a movie lot. Figures were 
modeled in various poses for placement in groups or in various 
settings or for use in connection with different backgrounds. 
Strange creatures were modeled for experimental study of imagi- 
nation. Other objects were devised for the study of form 
perception. Grotesque characters and attitudinal groups were 
modeled for other purposes. | 

All of these base forms were constructed upon five-ply fir 
board, 16 in. x 20 in. in size, and shellacked. In use the board 
was entirely covered with clay and when placed in the frame on 
the movable platform constituted a focal point for the play of 
lighting, background, and color illumination. The number of 
base forms is limited only by materials available and the prob- 
lems conceived. The number of problems applicable to each 
base form is limited to the artistic ingenuity or creative 
imagination of the individual. 

By use of an adjustable photographic stand, Leica photo- 
graphs 7° may be made from a vantage point that eliminates 
all surrounding accoutrements and apparatus. Only the scene 
itself shows and that can be made so nearly realistic as to deceive 
almost anyone. Plate I illustrates four compositions made with 
the apparatus and reproduced by the Dufay process. These 
constitute experimental materials now being used in a study of 
creative imagination by McCloy, to be reported later in 
Volume III of Studies in the Psychology of Art. The base 
forms are as follows: A-B Revenge motif (McCloy); C-D 
Picurts motif (Meier) ; E-F—G—H Earth Men motif (McCloy). 

It will thus be seen that by means of the device synthetic 
summer landscapes may be constructed in winter, cold winter 
scenes in summer, and any desired scene may be constructed, 
and then modified infinitely. That which may require days of 

10 Using No. 2 Supplementary Lens and Leica Tables. The photographic 
reproductions were first made on Eastman Panatomic and S. S. Panchromatic 
Film. By means of the recently perfected Dufay-process Color Film direct 


Leica photographs in full color are also made. These are then used by mounting 
directly between lantern slide cover-glass and by projecting in the usual manner. 





PLATE I 












STUDY OF CREATIVE ARTISTIC INTELLIGENCE 171 


actual tramping around in the field, exploring for some point of 
particular vantage may here be realized in the course of perhaps 
20 minutes of manipulation. Desert coloring for which an 
artist in the southwest may wait weeks may here be produced 
with approximate fidelity in the course of experimentation cover- 
ing perhaps 30 minutes—in addition, of course, to the time 
needed for modeling the mountains, cactus and arroyos. Light- 
ing effects on posed human models sought for hours without 
success may here be found in a short time. 

A“ sthetic feeling states may be studied in several ways. Given 
a certain background, the individual could be required to select 
the most suitable or fitting one,’ or, given a background, he 
could be required to model his own appropriate base form, or 
select some grouping of figures to merge into the background. 
Another problem would involve lighting a given base form and 
background to enhance the mood already in the scene. The two 
ends of all artistic organization, unity and fitness,” can be 
studied by this device effectively in a multitude of ways. In 
more or less obvious ways creative imagination, form percep- 
tion, color balance, rhythm, creative composition, decorative 
design and other aspects of art may be studied by individuals. 
In turn, individuals may be studied through the preparation of 
problems in advance, in the form of photograph or colored 
reproduction, reactions then being secured by recognized scientific 
procedures. 

The usefulness of the device is not limited to research. Its 
teaching function is apparent.’* The effects of various color 
blends, solutions of lighting problems, attainment of unity by 
lighting, and a multitude of other useful sources could be demon- 
strated to a class. Experimenting with various possibilities of 
still-life arangement or lighting of sculpture, by the student 


11 This is based on the assumption that ‘fitness’ in such a situation can be 
made a reality to the extent that artistic standards accepted by significant 
majorities and competent opinion would apply. 

12So0 designated by the Whitford Committee. Report on Terminology. 
Federated Council on Art Education. Boston: Berkeley Press, 1929. 

13 Qn being shown the apparatus this was immediately seized upon as a 


distinct possibility by a prominent Woodstock and New York City landscape 
painter and teacher. 
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himself, is also obvious. It is believed that in such a manner 
an inestimable amount of time would be saved, considerably 
more ground would be covered, and new organizations created 
that otherwise would have been unknown. 

The study of creative imagination in children is now in 
progress and will be reported in a subsequent volume. 
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